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AGCACACACA GGCGCGTCCT 
CACCATTCCC CCTATCACCG 
TTGCTTCGAT ACCCTGCTAC 
ATCGCAGGAA ATTGCCCGCT 
CCTGCTCGAT TCAGTGCGCG 
GCTCAGTGGC CAAGCAGAAA 
GCTGCGCTCT TGTTCCTGCG 
AGATGC CCTC TCTGCTACCT 
CGCGTTTAGG GTGTCTTCGA 
TTGCCAACAC AAGGGATTGG 
GCCGTTCCCC CAGGATCCAT 
CTGCACGAGG CAAATATCGC 
GCCATCATGG CGCGCGCGTG 
GTTGCACAGT ACGTGCTCCG 
GCACTCCTCG ATGCTGAAAs 
ATCCCGACGT GCGCGCGCTG 
CCAGTACAGC TGAACAGAGT 
ACTCAAGTGA CGGTATCCGT 
GTGCAGCTGC TGCGCTCGGA 
TCGGATCCGA CCGCTTCCCA 
AGGCAATGAG AGGACTCCCC 
TGCCAGACCC TGCGCGAATG 
CTTCGGAGCG CAATCCTCTT 
AGCTCCTGAA AGTGCAGCTT 
AGGGnACTGC GCATTCTCAT 
GACCTCATCT CTGAGGTAGC 
GTAGCACTCG GCATTATGAT 
CCCACGTGGA TTTTTTTTCC 
ATCGAGAAAA CGAACAGGTC 



TCCTCTTTTT 
GCATTATTGC 
CTACCCGCTC 
TGCGCAACGC 
CAAAAGCATC 
TGCTGGCCGA 
ATGCAGAAAC 
CAGACGAGTA 
CGCACTTGCG 
AAATAGCACC 
CGTGGTTGCC 
TGGTTTGGTA 
GAGTC TTCCC 
TGTGCGGCAA 
AGTGGGGGAA 
CgCACrCGCA 
CCCCCGGCCG 
TTTGAAGTCG 
GCAGCAGGCA 
GATGAAGAGA 
GTCGTGCTTC 
TGCGCACTCT 
TTAGGGTTAC 
CGTGCAATGt 
CCCCATGGTT 
CGAcGAgTGT 
CGAAACGCCC 
ATAGGGACGA 
AGCAGCTATG 
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GGGAGGATCC GCTATGCTCA GAGCCATGCG 120 

CTCAGCAGGA TGCGCCATCG GTCCAGTCTT 180 

CCGCCCTGGC GGATCCCGCC TGCGCCCCCC 24 0 

ACTTTCATAT GCGCGCGCCT CGCTGCAGAA 300 

CGGAGACGAG CCTGCACCCG AGTATGCCGT 360 

CGCGGCCTTC ATAGCTACCG TAGAGGAAAC 420 

TGCTTTGCGC AAGGCAATTA CCCACGTGAC 480 

CCTGCGTGCC CGGGCAGCCG ATATCCGAGA 540 

CATGACACCA CACCCACCGC AGnAAGCTCT 600 

CCACACTCCC CCTGGGAGCC TGACTTTAGC 660 

GCTCACGTAC AACC TGCGCA CGCACTGCGC 720 

AC CG AAGTGG sCAGCGTAAC AAGCCATGTC 780 

CTGCTCGTCA GTGCACAGGG ATGTAAAGAC 840 

ACTGCTCGTG CCACCGATGA GGCGCTGCGC 900 

AAACTGACGC TCTAGGAACC CTCACCGTAA 960 

TGCCTcACCC TTTCCTCACC GTCAAACACA 1020 

CCTGTGTGCT AAACGCACCG CTGCGCACTT 1080 

GGGCAAATAT CGTTATGCCC CAGGAAGCGT 1140 

TCGGACTGTT CCGTTCGGAG TTCTTGCTAT 1200 

CGCAgTGCTC TGCCTACACG CGCGCGCTGC 12 60 

GAACGTTTGA CCTTGGTGCA G AC AAAC TGG 1320 

CGGAC GCTGC TGAACCGTGT GCACACACCG 1380 

GAGGCATCCG CTACTGCCTC GCACATCCTG 1440 

CCGCGCCGGA rCkTGCGCAA CATGTGCAGA 1500 

TCACGGGTGG AAGAAATTCA CGCCGTCGCC 1560 

GCCCGCGCGC ACGTGAGTAC AC CCG ATCGG 1620 

GC TTCGGC AC TGATGGCAGC AGAtTCGCTC 1680 

ACGACTTAAC CCAGTACGTG TTCGCCGCCG 1740 

CCGATTACTT CCACCCGGCA CTCCTCCGTC 1800 
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TTATCCAGCA CGTAATACAT 
GAGAACAGGG AATCGGACGC 
CTTTCTTCTG GCGGGGCTCG 
GCTGCACACG TTCTTATCAC 
CGTGCAGCTT TCAGATGCGC 
AGGTATTACG CTTGAGAAAG 
GAGGCCTCAG GCGTTTTCTC 
GGTCCACAAA TCACCGCCCT 
TGCCGCATAC GAGGCGCGCA 
GAGCGAACGC GCGCGTGCCG 
TGAGCCGTGG CGTTGCGcTG 
CCCGCGAGGC GCAAGACGCA 
CTGCTCGTTT CGAAGAGGCA 
ACGCGTTTGT TACCATCCAC 
TGCTCATGCG CATGTACACG 
ACTTACTTGA GTCAGAAGGG 
CCTTTGGTTT TCTCAAGGGA 
ACTCTGCCGC GCGCAGACAT 
ATCACGTTGA GGTGCACATA 
gAGCAGGCGG TCAACATGTC 
CAGGgATAGT AGTCACCTGC 
AGCTTGTTAC GCGCCCGCCT 
CGGTTTGCTT CTGAAAAGAA 
CATCCCTACA CCATGGTTAA 
TCATGGACGG AGCGTTAGAA 
CCCAGTGTGT AGAACCACAG 
CTTTCCCCAA ATCGCGGTCG 
CTGTCCGCGC GTCAGTCcAC 
CACACCCTCT TTTCGCAACC 




417 

GCGCACAGAC ATCTGCGGCA 
GTGGTCATGT GCGGCGCCAT 
GCCTGCGAGC GTTGAGTGTG 
GCATTTCAGT CTCTGATGCA 
AGTCAGTCCG CACACTCATC 
ACGAGGAAGA ACCCTCACCC 
CATACACAGA AAGGAAAAGG 
CGAGGCGCGC GTGCAGGAAG 
TAGCAACGCT TGAgGCTGCT 
AAGCGCTGTT AGCGGAACTG 
CGCCGTGAGA GCGCAGATCT 
TCGCTGGAGC CAGAACTTTC 
TCGCTTACCC GTCTCC TGC A 
TCCGGCGCAG GAGGAGTGGA 
CGCTGGGCAG AGCGGCGCAG 
GGAGTAAAAT CGGTGACGTT 
GAAACGGGGG TACACCGGCT 
ACCTCTTTTA CCTCCACCTA 
CGGAGCGAAG ACATGCGGGT 
AATAAAACGG ACTCTGCCGT 
CAGAACGAGC GCACCAAATC 
GTACGCCTAT GAACGGCAAA 
GGATATTTCG TGGGGAAATC 
AGATCACCGC AGCAAGTGCG 
CCGTTCATCC GTTCCTACTT 
TGAACGGGAG TTACGCGCAA 
g TTTAGTGC A AnGGCACCGG 
CCTCCTGCCT CTTCCTTTCT 
GCCCGTACGA CTGCTGCGCA 




13041 



ACGTCCCGGT ATTTCTTTTG 1860 

GGCTGAAGAT GAAA t GCGCT 1920 

CCTTCTTCAC GCATCGAGAC 1980 

GAGCACTGTG CACGTGCAGC 2040 

GAAGAACATC TGCGCACCGC 2100 

CCTCGATCCC CATAGCGGAG 2160 

CAATGGAAAT CGAAGAATTT 2220 

TATGGGGGAG TCTTTGACGT 2280 

GCAGCAGCGC CTGACTTTTG 2340 

AAAAAACTAC GCGCAACGCT 2 400 

GCGCGCGTTG TACGAGCTTG 24 60 

CTCCCTTTTT TCAGACATTT 2520 

CGAAGAGGTA GACCGCCTCG 2580 

GGCCTGCGAC TGGGCACAGA 2 640 

CTTTTGCGTA CACATAGTTG 27 00 

AAAAATTTGC GGGTCACACG 2760 

CGTGCGCATC AGTCCGTTTG 2 820 

CGTCTTCCCC GTATTAGACG 2 880 

AGATACCTAC CGCTCAGGGG 2940 

GCGCATCACG CATCTGCCTA 3000 

AGCAACCGTG CAAgGCGCTG 3060 

AAAAACAGCA GGAACATCAA 3120 

AGATTCGCTC GTACGTCTTT 3180 

AAACGGGGAA TATTCACGCA 3240 

GGAGTTTCTG TGTACCAGTA 3300 

TCATTTGCAG CACTGCTTTT 3360 

cGCCGTCCTT TGACTCTTTC 3420 

AGCATCACCT GCAGCGCCGA 3480 

cTGCTGtCTA CGCCTCCTGC 3540 



Printed from Mimosa 02/03/22 07:24:21 Page: 419 



WO 98/59034 



# 



PCT/ 




418 



nCCCGCCCGT CCCCGCCCGC GCCCCGATCA CGTAATCCCC AAGGAAAAGT GGCGCCTTGC 
CGTTGCAGAC TTTACCTTTC ACGGTATTCC AAAGATTTTT CAGCGCTACG TGCGTCCTGC 
GCGGGAGctA CTC TTTATTG AACTAAAAAA ATTACCCCTC CGTCATTTTC TTTCTGAAGC 
TGAACAGCGC GAG c GCGCCG CCTTGCCCCA CGAAGAAGCC TACCACGCCC GGCTCAAAGA 
ACGTGCACAT TTACAGCGsG CGCGTGATTT TGTTTCCTTG CACCCTGTCA GCGATCACGC 
GCGCCGTCTG CGTACGGCAG CATTTGAAAA GCAAATCAAA GAGAAGGAGC AAGAAATCGA 
GCGTGCCCGT GTGGAAGTGC GCACgCACGC GCGCGGTTTT TCCGTCCCTG GCTCCAGGCA 
GAGGTGCTCG TCTTAGGTGC GCAAAACGAA CCGCATGCAC TGCCTGAGCG CTTTCACCTT 
GCCACCCATT TACGGCAAAA AAAACTTTCT GCACTGGTTA CGGGAAAACT CGTAGACGTC 
GCCGGTTACG TGCGCATATC TCTCTATCTT TCTACAGGGC TAGAAGCAGA ACCCACGCGG 
GAATTCACGC TCGCAGGTCC CTACCGAGAA CTGCCGCGTC TTATGCACAC GCTGTCTGCA 
CAATTGCGCA GTGCCATTGA AAACGCACAA CCGGTGCGCA TTGTGTTTGA CGTACATCCT 
CCGCATGCAC GTCTTTCGTT TCAGGGCGTG CCGGT AG AAG ACCTTTCCAA ACCTCTTATC 
TCATACCCGG GCCGCTACGT GGTGGACGTG TCTGCTGCAG GATACTTTTC TGCCACAAAG 
GAAATATACA TTGAAAACCG ACCTGCCTTT TCACTACGGG TGCGTTT AG T TGCCCGTCCA 
CAACATCGTG TGCGCGTGCA GCTTACTGAC AACAGCGCAG cACCTATCTT TTCTGGCGCA 
CGCTCAGTGG GAGTCACTCC CTTCAGCACC GTGGTTACTG ACTTGCGCGA AATTTTC AC C 
GTCGGACCGG CAGGCGCGCG TTCGTTTGCC TTCATTGAAC GCGGCACATT TCCTAACTCT 
CAGCCGAGCA CGCTCGTGTT GCCTGCGCCT AACCCAAACG CAACACAGGA TCTTGCGTAC 
AAAAGGGACG TAGCATACTG GTCTTTTGGA GCCCTCTGCA TTGCCGTTCC CATCGCGCTC 
ATTCTCGGCT CCACGCTTGC AGACACGCAT CAGGCGCTAG AACGCGCAAA AGCTGCAAgC 
GCGgCAACCT CCTCCCCCTC CTGCACCGGC CGGCACGGGC GCATTAGAAC GTAAAAGCCA 
GCACCTGCTC ATCGGCACGG GGGTAGCAGT AGGAGTGGCG GTTATCCTGA GCATTAATTT 
CATCGTGCcA CTGCGCGCTA TTTGAACGCG GTGATGCACA ACGCGCCACA GGCAGTACGT 
CCCCGCGCGG ACAAAGACAT ACAAACATTA ACGCACCGCG ACGAGGCAGA AGAAGATCAG 
GAAGAAGATT CCTAAAGGAG CGTGAAGGTG GGTTTAGGAA ACCTAGCACA GAAAATACGA 
CGCCTGCTCG GTGGACAGGC GCCTCTGGAC GAAACGTTTT TTAGCGCGCT TGAAGAGCTG 
CTCATCGAAG GCGAC CTGAG TCTTTCGACG GCAGAGAGCT TTTGCACACA GCTTCGAAAC 
GCCGCGCGCA CACGTTCTGT ACATACGGAA GACGCAtGCG CACGCTCTTT GCGGAAATTA 



3600 
3660 
3720 
3780 
3840 
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3960 
4020 
4080 
4140 
4200 
4260 
4320 
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TGGAATCGTG CGTACGCGTT ACCCATCTTG CACCAAATCC GAACCAGTGC TCACTGTATC 
TCCTACTTGG GGTTAACGGG AGCGGGAAGA CCACTTCTGC TGCAAAGTTG CAGCGTACTA 
TCAGACCCAG AAGGTGCATC CGATACTGTT TGCCGCCGCA GATACGTTCC GCGCAgCAGC 
GGCAGAACAA CTCGCACACC ACGGTGCACA GCTAGGCGTG CGCGTCATTG CGCACCCGGG 
GGGAAAAGAT CCTGCTGCaG TGGTATTTGA CGCAGGAGAA GCCTTGCgcG cGCAAAAGCG 
S GGTC TTTTA CTCGTTGACA CCGCAGGGCG ACTGCACAAT AAGACGCACC TCATAGCGGA 
GCTGCAAAAG ATCGACCGTA TTGCGCAGAC AAAGGTGAGC GCAGATGCAT AC C GC AAG AT 
ATTGGTATTA GATGCCACCA CCGGTCAAAA TGCATTTCGT CAAGCGCAAA C t TTCACGAA 
GCTATTGGCG TGGATGC ACT GCTCCTTGCA AAATGCGACA CACGCGCACG AGGGGGAGCA 
GTTTTTTCCA TCATGCAAGA GTTAGGTATT CCATTAGCCT TTTTAGGGTG GGGGGAGCGC 
TATACAGACT TGGTTGAAGC GAACGCGCGC GAGTTTGTTT CCTCGTTCCT GCACGGAGAA 
CGATGATTCG ACCCCGGTAT GGCTGGATGT ACAGCAGCGG GATTGCAGTG CACCTGTGTG 
CgGCCgTGTG CGCGCACAGT GCTGTTCCTG CCGCGTGGAC CTTTGCAGAA CAGACACAGG 
CGCAAAAAAC AGACACTCCG CTTGATTCCT CCaGtACGCA tGaCCTCCCC TGAGGAAGCA 
CCCAATGAAG CAGATCCGTT TGAGAAGGAA CTGGaACACG CGTTCGAAAG AGCGCACGTC 
AGCACAGGCG GTGCAGATTC CTCATCACAC GCCGATTTTG TACACATGGA AGAGGCAGGA 
CGTGCCCACG CGTCCGCCAA TCGCTGGTAT CACGAAACGT TTGACTCGCG TCAGCGTCCA 
TCCTC TGC AG TTCTGTACGA AGGGGCACAG CTACTGCATA CCGTTCACTG GCACTATGTC 
gGGGACCGGC TGTTTCCCTG TGAAAAAATA ATTACCACAC . CACACACACG TATnCCGCGC 
GCGCTATAAT TTTTCCGGAA AGATCGTCGC GTACGAAATG CACACGCGCG GGGTACTGGT 
ATACGCACGC ACCTATCGGT ATGnAnCGCA CGCGCGTATA TGTGAAAAGG AAGAAACAAC 
TGCTCGAGGG AATGAACGCA TTACGTATGA 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TTTTTGCGCG CGTTCTAGCA CCCGAGTnAA TAGTGTTTTT TGAAAAATGG AGGnTCGCGT 
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CTACCCAGTT GTAAAAAGAG tGTTCGCGCG CGTCCgTGCT CTCCACACGA AcGGAcTCCG 
TCCACTCACG AAAGATATCG TGTCCGAAGT ACAATACGAG CATCATAACG TAAACTGCAC 
AGGGTGTTGT GTACgtsTGt GCGCACATGT AGC ACCTTTC CCATACGGAG ACACAGGGAG 
AAAAGTTCCA AACGGGATGT GCACGCGTAA AAAAGTGAAG ATGCGCACAG CAATAAAAAT 
AGGCGGATGC CTATGTGGTA TCCTGCATGT ATGTGCTGGT TGATTTGTGT AGCGCTAACA 
ACACCGGTGC TGAAAAATGT GTGCAGGATC TTCAGAGAAA AGAAAAAAGC GAGGTAGATA 
CTAACTACAC GTATGTGTGC AGCCACGCAT TTCAGAGAAC GTGTGACAAT GAGCGTTAAT 
GCGAGCAGCA CAAGTGTAAG CGACGCgTGC ATACACCAAC TCTGTGCATT ACAGCACAGG 
AGAAGGAGAG CGAAgcTTGC GACTTTCGCT ACGGGAGGAG CACGGTGAAG CAGCGATTGC 
CGGCGTTCAT AAAGAGAGAA AAACATACAT TTCCTTGACT CCTCCTAGCA GGTGGAAGTA 
CTGCATGCAT GTCGCTACCT AGGTCCAAAG GAGATCTGAA AGAGTGAGTG GACAGTGCAA 
AGGATCTCTT AGTC CGTAGC AAGAGAAAAT TTTACTGTCG AGTGCGTCCT GTGGCGTGCC 
ATCGTAGGAA ATGACCCCCT TAGAAAGGAT GCATAGACGC GTCGCAGCAG CGAGTATTTT 
TTCAACCTCA TGGGTGATGA TAACAAGCGT TTTACCTGCG TGTTTGAGGC TTATGATGAG 
CTGCACAACC TGACGAACGC TGGGGTAATC TAAGTTTGCA AACGGCTCGT CAAGAATGAC 
TACCTTTGCA TCCAAGGCGA GTACGCCGGn CAACGGTTAG GCGTCTTTTT TCTCCACCTG 
AAAGCGCTCG GGCGTAATGG TCACGCCGGT CAAGCAGTGA CACGGcTGCA AGTGCGCTGT 
TGGTACGTGC GTCAATTTCT GCGCGGGAAT ATCCCCACTG CAGAGGACCG AAGgCGCAGT 
CCTCAAATAC CGTTTCGCCT AGGATCTGGG TGTCTGCATT TTGAAACGCC AGACCGACAG 
TAGTCCCGCG TGCCATATAC ACACGGCCGG AGGAcGGCGG TTCAAGTCCT GCAAGaTGTT 
CATGAGCACA GTTTTACCCG AGCCATTTGC ACCTGCGAGG ACGACACAGT CCCCAGGAAA 
CACCCTAAAC GAAACGGAGT GTnAATACTT CACAGTCGCG CTCAAAAGAC TT AC TT AC AT 
TGACCAGTTC AAGCAGCGGT CCTGCGCACG ACTCCACAGC CGTGTCTGCC GCGACATCTG 
CGCTCATGCG TCCACAGAAG ATTCACCGTG CCCTGTGGCG CGTACACACC CGCGACACGC 
GAATACTATG GCATCAACCA TGACGGTACA AATGGCGGCG AACTGTAGGG GCAAGATGAT 
GGGCGAGTAG GACAACAAGG GCGATTTTCA GGGTGTCGGC AAGAAAAAAG GGAAGGAAGA 
ATCCTAGCAT GAGCTCCCCG GTCtTGAGGC CAAGCACGTA AcCGAGAACC GgCAGACCGA 
TGGAGTAAAT CGAAAGAAAA CCAACGAGCG TTGCGACGnT AAGTCTGATC CAAAGAAGGA 
GTGCGCGTTC CACCACCGCG TGGTGGAGTG CAAGACAGGC GGTGGTGCTG CGCGATTGCc 
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180 
240 
300 
360 
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900 
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CAcGAGCGTA GCAGCGAGTA TGTATCCAAG GAGGAATCCT CCCGTAGGGc AAAAAGCGCG 
GTGTATCCGC CCCGACCTCC TGAAAAAACC GGCAGACCAA GGAGTCCTGC CCCGAGGAAG 
CTGAGAACGG CGAGTGCACC GTCTCGCGGT CCCAACAATA AACCGGTGAG AACGGCCGCT 
GCATTCTGCA GTACAAGCGG AACAGGCTTG AGAGGAATGC TAACGAGCGC ACTCGAGCTA 
ATGAGTGCGG CAAAAAGCGC AACAAAAGCC AAAGACTTAC TACGGTGCAT GGTACAGTAC 
TCCTCCGAAG GTTCGATGCG CAGTGTGACA CGGAAGGAGG AATCTTTCAA TATCTTGGGT 
GGTGCCACAG GTATAGTTTT TAACAGACTT ACCCGAACGG CTGCCAGGTG CGTACGCGTC 
GGTTCGTTCC CACTGTGCGC GGGCAGAGCT CGTGTAGTGT CTATTGACAG ATGCAAGGAT 
CGGGTACCGT CATGTACGCA gTTTATGGTA GGCTCAGTCC TATGCACGCT GGGGACAGAG 
AGAGTATCGC ACGCTTCGTG cGTGTGGTGC GCGATTGTCT GGATTTGTTT CGCACCGAGG 
GTATTGGGCC CCGTCCTAGG AATGATTCGG TAATTTTACC GAATGCTGCG TGTTCACCGC 
GTAATCATGC AGGAAAGCGT GCGCAGAGCA CTGCCGATGC GTGTGTGAGA AGCAGTGACG 
GGTCTGTATA CACGGACGAA ACCTTGCGCG AGGAAATTTT TGCATGCCGT GCGTGTGAAT 
TGTATCAACG GCGTACACAT GCGGTGGTGG GAGAGGGTGT TGCAGACGCA GACGTGCTCG 
TCGTTGGGGA GGCCCCTGGA GCGGAAGAAG ATCGAAGCGG TCGTCCGTTC GTAGGACGGT 
CAGGTAAATT GCTGGACGCA ATGCTTGCGG CGATTGGACT TTCGCGTCAg cAAAATTGTT 
ATATCACCAA TGTGGTTAAG TGCCGGCCGC CAAGGAACCG CACACCAACA CCCCACGAGA 
CTGCCTGTTG TGCACGGTTC CTCCATGCGC ATCTTACGCT GCATCGCCCG TGTGCTATTT 
TGGTGCTCGG CCGcTGCGCC GCACAGCACA TGCTCCAAAC AACCGATGGT ATTGGCAAGT 
TGCGCGGgCG CTTTTTTACC TA t C AGGGg A TtCCCCTTCT GGcTAcGTAC CATCCGAGTG 
CGTTGTTACG GGATGAAGCG CTGAAACGTC CGGCGTGGGA GGATCTCAAA ACGTTTCGTG 
CACGGTTGCT GCAGTTGAAG CAGGACGCAC ACATGCCAAT ATAAAATCAT GGCGCCGTGG 
CTTGAGCTTG TTTTTGAcgT TCCACTGGAT AAAAGCTTTA CGTACCGTGC GTGTGCTGCC 
CACGCGGGTG AgGCACTCGT GGGTAGACGG GTTCTTGCTC CCTTTGGGGC GCGTACACTC 
ATTGGATTTG TGATAAGTGA ATCACATTCT TCGCCTGCTG ATTGCGGTGG TGCAGTTGGC 
ACGTTCAAGG AGATCATCCG CGTCATTGAC AGGGAAGCGC TTTTTGACCA AACGCATCTT 
GCGTGTGCGC GTTGGATGGC GCATTTCTAC CTGTGTGCCT TAGGTCAGGC GCTGTGTGCG 
GTGGTTCCGT CTCGGAAACG AGAACGGACA TTGTCTTCTT TTGCTTCTTG TGCGGGTGTT 
CGGCGCACTG AC AC C TATGC GCTTTCGGGC GAACAGCGCA AGGCGATTGA TGCGATTACC 
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GCGAGCACCG GTGCGCGCAg 
GTGTTCTTGC GCGCACCGAG 
CTGAGATAGC GCTCACTCAC 
CGGCGGTGTT GCACTCAGCG 
AGTGCATGCG TCACTGTGTA 
GGCTGGGCCT TGTGATAATG 
CGCGCTATCA TGCGCGgCAG 
TCATGGGGTC TGCAACACCG 
GTCGTTTACC ATTGACTGCG 
CGTGTCAAAA GAGGCCCTGT 
GGAGGCAGGA TATCAATCGA 
GTGTCGCAGC TGTGGATACA 
AAACGTGTGG GGGCAATGCA 
TGTCCGTGCT GTCATTCATT 
GAAGCAGTAC AAGC GCTATT 
CGCTCAGGGC ACGTGCAGCA 
TTGGGTACGC AAATGATAGC 
GCCTGCGCAG AT AC TGGACT 
TTGATGATGC AAGTGGCCGG 
CAAACACGCA ATCCTGCGCA 
TTTATGCGCA AGAACTTGCG 
TTCGGTTTGT TTTTCGCAGC 
ATGCGCTTTT GACGGCGCAG 
TGGTGGCGCA GGTGGCAGGC 
CAGTGGTGCA GCAGGTGGCG 
ACGTAGAATC TG AC GTAGAT 
TATCCTGCTG TTTGCGTGTT 
ACGGAAAGGA GAGAGGATGT 
GTTGATAAGC CGGCAGGACT 




422 

TTTTTATGTG CACGGGGTGA 
GCAGTCCTTG CGCGTGGCAA 
CAGGTGCTCC AGGAGGTATA 
CTCAGTGGCA GTCAGCGCCT 
GTGATTGGAG CTCGGAGTGC 
GATGAAGAAC ATGACAGTTC 
GTAGCGATGT ATCGCTGTGC 
TCTGTGGAGG CCTGGTACGC 
CGTGTTGCGG GGGGG cTCCG 
TGCTCTCTAC CCGTCTGGTG 
TGCTCTTTTT GAATCGTCGA 
CGCTGTGTTG CACGCAgTGg 
ATGTCATTAC TGTGGCAGGC 
TGATACCCGA TACGGCGGGG 
TCCTGAATAC CGTATTGCAC 
GACGATGGAG CAGTTTCGCG 
AAAGGGATTT AATTTCCCTA 
GCACACGCCA GACTTTCGCG 
ACGTGCAGGT CGCTATGTAG 
TCtGCGGTGG TGTGTGCGCa 
CAgCGGGAGG CGCTGTGTTT 
AAG AC GCGGC GCAAGGCTAA 
ATGCCTCTGG GTGCGGATGT 
AGCTATCGGA TGCAAATACT 
CGCAGCTTTT TAGATGAATT 
CCTGTAAATG T AC TGTAGGG 
TGGTTGACCG GTAGTATGCG 
GGCACTGCCG ATTATTTTTC 
TGCAGTACAG CCGGGTGCGC 
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CAGGGTCGGG GAAGACGGAA 3 600 

GTCGGTTATC TATCTTGTTC 3660 

TGTGCGCTTT GGCAGTCAGG 37 2 0 

AGGTGAGTGG CGGCGCATAC 3780 

AATTTTTGCT CCGTTGAAGC 3840 

G TAT AAG TC T GCGCATGTGC 3900 

GGACGCGAAC TGTCCGTTTG 3960 

GATGCTGCGG GGGGCGGTGC 402 0 

CCGCGTGTTG AGGTGGTGGA 4080 

GATGAAATAC GCAAGACGAA 4140 

GGATTTTCCT ATTCGTTTCA 4200 

CAGTTCCCTT GACG TGGC AC 42 60 

AAGAGGCGCC GCCTGAAAGT 432 0 

TGGGCACAGA GTATATTGAG 4380 

GGGTGGACAC CGAtGCGCTG 4440 

CGGGGAAAAT CGATGTACTG 4500 

CGCTGCGTTT AGTGGGTATT 4 560 

CCGCCGAGCG GAGTTTTGCC 462 0 

ATAACGGCCT GGTCATCATC 4 680 

GCACGGGGAT TGTGAGTCCT 4740 

TCCGCCCTTT GTGCGCCTTA 4800 

AGACGCCGCG TATGCGGCAC 4 860 

ACTGGGACCT GCAGCGTGTG 4920 

GCTGCGTGCC CCATCATTCC 4980 

TCGAGCTCCG GCGGGGGTGT 5040 

CGAGTAGATG TACTCCGTGT 5100 

GTGCCTGGTA TAGGTGCGGG 5160 

AGGACGCAGC gGTGGTGGCC 5220 

GGGTGCGGGT GTGCGTAgTT 5280 
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GACGTATTAC AGAAACAGCT 
ACCGCGGgCG TGCTGCTGTT 
TTAGGCAGCA TGCG TGTGAT 
gTGTGGTGAT AT t CGCGTTC 
gCGTGCCGCG CATACTGCAT 
ACTCACtTGC ACAGTGGTCG 
CTATAATTGG GGATGACAAA 
GGGGAGTAAA AAGG CTCC AG 
CGCTGGTGTT GCGTGCACGT 
TATGATTGCC t GTAGC AGGG 
ACAGAAAGAG TGTCGTGTGA 
GTGTTTTTGC AGGCGCGAGG 
AGATACCGGC GCCTATTGTT 
GGGGATGGCC TATGGGTACT 
AGATGAATCT GCAAAAGATC 
ACATCGATAC TTACAATTCT 
GCTACACGCG TATTGTATTT 
GTTCTTTTAC GGCGTTTCTA 
ATATTCAGCC GAGGGTGTAT 
TTCGCGATTC TATTGCGGAT 
TGGAcACCTT TCCgAAGGTG 
AGCAGGGGCG TTTTCGTTGT 
ACGTGTCTCT CGGTTAGGTG 
TTAGGGAGGT GTTTCGTGCT 
CCGGGCGGCT CGGGGAaGTC 
CCATGGTAAG GTTGAGGTCA 
AGGAGGGGTG GTATGGCAAA 
TGCAAGCGGC GTAATTACAC 
CTCAGGAAGT ATTGTCCTTT 



TGGGGTGCGT 
TGCAAAAmAT 
TAAmGtATCG 
CTATCcGTAC 
ACCGTGTGTT 
GACCCATCAG 
TACGGTGATT 
TTATTCGCAC 
ATGCCTGTAC 
CATTCTGGTA 
ATTTCAATAG 
GGAGGGGAGC 
ATATCGGGCT 
GTTGTTCCGG 
GATGACCTTG 
TCCTTTTTTC 
AACTGCGCCT 
AAAACGGTCA 
GAGGTTTTCC 
GCAGTTAGCC 
TTTTcTTGCC 
TCCGAATGTA 
ACGCGCCTTT 
TGATGTGTGA 
CTGTCTGTGT 
GCGGTTCAAT 
GAGGACGGCG 
CACTTCAAGA 
TGAGCGTAGA 
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CTGTTTCCTC 
GCACGGGCAG 
CGCACTTTGT 
CGGTACGGCa 
GCGTGCGACT 
ATTCGTATTC 
TCGCGCGTAA 
ACAGTCTTGT 
ACTTCCTGCG 
rGCGGTGTGT 
TTTTTCTCTA 
GGTCCCCTGC 
ATTTGTGCTA 
GATTCGATGA 
AAGGTGGCGT 
AAAAGAGGAT 
CTTTGAATTA 
AGCCtAAAGG 
AGTTACTTGG 
TTTTTAGGAA 
CgATCTGCTC 
AGACGATTyT 
CTTTCTGGGG 
GGTGAGGCGT 
TGCTTCTGTA 
CCCGCTCGGA 
GTGGAGCTTA 
AACCGACGTA 
CGTGTGCTGC 



TGCATCGTTT 
CTGCTCTGTA 
TTTGGGCGAC 
GCAAGGCGGC 
GATACGCATA 
ATCTGcTGCG 
CAAGGCGTGT 
GTTGCCATGT 
TGCTCTTGAT 
GGTTTTGAGT 
GGGTGTGTAC 
TGCTGTACTG 
GAGTGTGCGA 
CGAGAAAGAC 
CGTTGTTTTC 
TGCG AAGG TT 
TGTCTCCTCC 
TGGCGATATT 
TTTTTCTCAG 
CAAGgTCTCa 
TAAGAAgTTA 
CGCCCTTGAC 
CAGGCTGGTT 
TATACAATGC 
GCTCAGTTGG 
AGCTTCCGTC 
TTGCGCTTCA 
ACGTTCAGGA 
ATAGAGAGGC 



GGACAAGGAC 
CCAGGGGATT 
CTCCCCGAGA 
GTCAgGTTGT 
CATATCTTGA 
CTAGGATGTC 
GCTCGTGCGT 
GCATGTAAAC 
GCCgTTGcGC 
TCTGCCGGTA 
TGCACTCGTT 
TCTGTAGGGA 
AACCGCTAGT 
GAAAGTCTTA 
CTCAACGGGT 
ATCGATGCAG 
ACTGGAATtG 
GTTCTCCTCG 
TTTTTTAACA 
CCGyTGAAGg 
AAGGCGACTA 
GCGAGCGCAC 
GGCTGCCTGT 
GGGCCGGCCT 
CAGAGCGCAA 
tGTGGATGTG 
GTGCACTGGA 
AAAGC TCGAG 
GAAGATAAAG 
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TAGGCTGTCG TCATATCTGT TACGCACGGG GTTTTCTGGT GTTTTCCGGG GATTTGTGGG 7080 

TCAGTAGCTC T AATGG C AG A GCGTCGGTCT CCAAAACCGA ATGTTGAAGG TTCGAGTCCT 7140 

TCCTGGCCTG AGTGCTTTCG AAAAGGTGTT TCATGTTGAA GTTCGCAAAG TTTCGTAGGG 7200 

AGTGCGTTGC CGAGTTCAGG AGGGTGGTGT GGCCTGCGCG CACTCAGGTA CATACCGCGG 72 60 

TTAAGGTAGT GCTCGTCTCT ACCGTTGTCA TGGCGCTTTT CCTCGGGCTT ATCGATGCTC 7320 

TGTTCGTGGC GTTGCTGAGT TTCTTCTTCT GAGGGGATAG AATGGCGAAA GAGTGGTATA 7380 

TTCTGCACAC ATTCTCGGGT CGCGAGGCAA GGGTGGAGCG GGCTGTCCGT ATGCTCGTGG 7440 

AGCATGCGAG GATTCCAACG AACGTTATCT TTGATATAAA AATCCC TG AG GAACTGCTTA 7500 

CCGAGGTGAA AGATGGTAAG AAGAGGGTGG TTAGGCGTAA GTTTTTCCCT GGTTACTTGT 7 560 

TGGTGGAAAT GGATTTGCCC GAGGTTGACT GGAGGATAGT GTGTAACGAG GTGCGCAGGA 7 620 

TTCCTGGTGT TTCCGGTTTT TTGGGTTCTT CGGGCAATGC GAACCTCAGG CGGTTTCTGC 7 680 

GGATGAAGCT CGGCGTATTT TGCAGAAGGC GGGGGAAATT AAGGGGGATA GGACTCCTCG 7740 

TATCGCTCAG ACTTTTTTGG TTGGACAACA GGTGAGGATC GTTGAGGGGC CGTTTGCTAC 7 800 

TTTCTCGGGT GAGGTGGAGG AGGTGATGAG TGAACGCAAC AAGGTGCGTG TGGCAGTCAC 7860 

CATCTTTGGC CGCGCTACTC CTGTGGAGTT GGAGCTAGTC CAGGTGGAGG CGCTCTGATT 7920 

TTCTTCTTCC AGGG TGG AG A GTGTTGCAAT GCGCATGATT GCCTGCCGCT TACGCGTTGG 7 980 

TTTCGGGTGT TTTGTTGTTT TTTACGTCAT AAGGAGAGGC CAGTATGGCA GCGAAGAAGA 8040 

AAGTGGTTAC TCAGATAAAG CTGCAGTGTC CTGCAGGCAA GGCGACGCCC GCGCCGCCGG 8100 

TTGGGCCTGC GCTTGGGCCG CACGGGGTTA GTGCCCCGCA GTTTGTGCAG CAGTTTAATG 8160 

ACCGTACTAA ATCCATGGAG CCTGGGTTGG TGGTGCCAGT GGTTGTCACC GTCTATTCTG 8220 

ACAAGAGTTT TTCGTTTGTG CTGAAAACGC CGCCTGCGGC TGTTCTTATT AGGAAGGCGT 8280 

GTGGGATCGA AAAAGGATCG ACGAATTCTG TTAAGCAGAA GGTTGCGCGC TTGTCGCTGG 8340 

CGCAGTTAAC GGAGATTGCT CAAGTGAAAT TACCTGATAT GAGCGCTTTA ACTCTCGATG 8400 

CTGCGAAgcG TAnTCATCGC GGGTACGGCA CGCAGCATGG GGGTGGAGGT AGAGCGTTCA 8460 

TTATGAAGAG GGGGAAGAAG TATCGCGCTG CCGTTGCGCG TTATGATCGC GCCGAGCGGT 8520 

TCAGTCTTGA CCGTGCGGTA GGTTTGCTTA AGGAAGTGAG GTATGCTTCC TTTGACGAGA 8580 

GGGTGGAGGT G C AC G TT AG T CTGAGGCTTA AGAAGAATCA GACGGTGAGG GATACGGTTG 8640 

TGCTCCCCCA CCGTTTTCGG GCCGAGGTTC GTGTGCTCGT TTTTTGTAAA GAGGATCGTG 8700 

TTTCGGAAGC GCTTGCTGCA GGTGCTGCCT ATGCAGGCGG TGCTGAATAT CTTGAGAAGG 8760 
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TAAAAGGAGG CTGGTTTGAC TTCGACGTGG TCGTTGCTAG TCCTGACATG ATGAAGGACG 8820 

TCGGTCGTCT TGGTATGGTG TTAGGTCGCA GAGGGCTGAT GCCTAACCCG AGGACTGGCA 8880 

CGGTCAGTGC GGACTTGGGG GCTGCTGTCT GTGAGTTGAA AAAGGGGCGT GTCGAGTTTC 8940 

GCGCGGATAA GACAGGTGTG GTC CATC TAG CAGTAGGGAA AACGACGATG GACTCTGCGC 9000 

AGATTGTAGA GAATGTTGAC GTGTTTCTGT CGGAGATGGA TCGCAAGAAG CCCGTTGACG 9060 

TAAAAGCTGG TTTTGTCCGT TCGATTTCGC TCAGCTCCAG TATGGGGCCT GGGATTTGGG 9120 

TTGTCCATAA GTCAGAGGAG TAGTATGGCA GTACGCGCAC GAAGGCTGCA GCCGGCAAAG 9180 

GTGGCTGCTG TCGAGAGCCT TACGCGTGAT TTGGGTGAGG CTTCTTCTTA TATCTTTACG 9240 

GAGTATCGAG GGCTTACGGT TGAGCAGCTG AnCCgcGTTG CGsCsCGCct GCGCGAATTC 9300 

TCGTGCGTGT ATCGGGTGGT GCGTAACAAT TTTGCGAATA TCGCCTTTAC GTCCCTAAAC 9360 

ATGACGGTGG GAGAGTATCT GGTGGGGCCC ACGGCCATCG CCCTAGTGGA CACGGAGCAT 9420 

GCGAATGGCG TCGCGCGTGT GCTGTTCGAT TTTGCAAAGG AAGTGCCTGC CTTAGTGGTG 94 80 

AAGGGTGCAA TTCTTGATGG GGAGGTGTTT GACGCTTCGA AGGTAGAAGC GTATTCGAAG 9540 

CTTCCTGGAA AGAAAGAGCT CGTTTCCATG TTCTTGTCCG CGCTGAATGC aACGACGGTG 9600 

AAGTTCGTAC GC g T ATT AC A GGCTGTGATG GACAAAAGGG ATGAgGGTGT AGAAgTTTCC 9660 

GTGGTGTCGG GAgGTGATTC GTCCtAgGCg GTTGTTGTAA CTTAGTTACG GGGTATGTGT 9720 

TaGGCcGGTc AGGCTTCTGG GGTGCTGTCT TCCTGTCCGT TTATAGGGGT TATTTCGCAT 9780 

ACAAGGAGAA GATAATATGG CGGCGTTGAG TAATGAACAG ATTATTGAgG CGATTCgGGG 9840 

CAAGACCATC CTGGAGCTTT CTGAGCTTAT CAAGGCGGTG GAGGAGGAGT TTGGAGTTAC 9 900 

CGCGGCTGTG CCgGTAGCGC CGGTAGCGGA AGGTGGCGGG GCaGGTTCTG TAGCCGCTGA 9960 

GGAGCaGACA GAGTTTACTG TTGTGCTTAA AGGACTTGCA GAACCAGGCa AAAAAATCGC 10020 

GGTTATTAAA GAGGTGCGCA ACGTTATCTC AGGGCTTGGC TTAAAAGAGG CGAAGGATCT 10080 

GGTGGAGGGT GCGCCAAAGA CTTTGAAAGA AAATGTATCC AAGGAAGAGG CGGCAAAGAT 10140 

AAAAGAGTCA ATGACCGCAG CGGGTGCGCT CATTGAGATT TCCTAGTGTC TGGTTTTTTT 10200 

TGCATGCGTC CGGCGCGTCG TTGTGTGCCT CTGACACCCT TTCGTGTGGG AGGGCGTCGC 10260 

GCTTTTGAGT AGAGCGTGGG CTTCTATTTC TTTTCATACT TGTTCTCGGC ATTTTGGCAT 10320 

GCGGGTTGGG TCGCGTTCTC CTCACTTGAG TGGAGGGGAC GGCGTCTCCC CTGTGTGGGG 103 80 

AGTATTACGG TAGAGCGTGT GGTATAGGGA GCACCGTGTC GGTTCGGTGC AGCTTGAGGG 10440 

GGGAGTGCAT GTCAGCACGA GTTTGCAAAA CACACAGAGT GTACGTGGGA AGGGATGTCA 10500 
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GGAATTTTAT GGACATCCCG GATCTCATCG AAATCCAGCT TCGATCTTAC GACACc TTTC 
TGCATGGGGC CCGGAATACA CCGTCCGGCG CCGACACCCT TATCTCCGGT ACTAGAGAGG 
AGCTCGGCCT CGAAGACGTG TTCAAGACTA CCTTTCCTAT CGAGAGCTCT ACGGGGGACA 
TGACGC TCGA GTACCAATCA TACTCCCTTG ATGAGAAAAA CATCAAGTTC TCCGAGGCGG 
AGTGTAAACA AAAGGGTTTG ACGTACGCCA TTCCGCTGAA GGCGCTTGTT GATTTACGTT 
TCAATAATAC GGGGGAGATT AGGCGCAAAG ACATTTATAT GGGAGATATC CCCAAGATGA 
CTGAACGCGG CACCTTTATC ATCAACGGTG CGGAgcGTGT GGTGGTATCC CAGATCCATC 
GTTCCCCTGG TGTTGTCTTT TCTCATGAGA AGGACAAGGA AGGACGGGAG GTATTCTCCA 
GCCGCATTAT TCCGTACCGG GGAAGCTGGC TTGAATTTGA AATTGATCAG AAAAAAGATC 
TCATCTATGC AAAGCTTGAT AAAAAGAGAC GTATCCTAGG CACCGTGTTT TTGCGTGCGT 
TGC AC TACG A AACGCGTGAG CAGATCATCG AGGCCTTTTA CGCCATAGAA AAGACGCCTG 
TTTGTCAGGA TCGTGCGGAG TACGAGC TGC TCACAGgTAA GATCCTAGCA CGATCGGTGA 
CGGTGGAAAA TGAGCAGGgT GAAACCGGGT GTTGTACAAA GC AGG AG AG A AAATCCATCC 
CCATGTCATC GATGATCTGC TGCAAAACGG CATATGTGAG GTCTACATTA TTAACCTTGA 
AGCGGAAGGT TCGTTGCGTT CTGCGGTCGT TATCAATTGT CTTGAACGAG AGGAAATGAA 
GTTCTCTAAG TCGGGTGCAC AGGACGAgCT TTCGCGTGAA GAGGCACTGT GTATTGTATA 
CTCAGCGCTA AGACCAAGCG ATCCTATGAC CATGGACGCG GCGGAAAAAG ATTTGCAGAC 
AATGTTTTTC TCCCCACGTC GCTATGATTT AGGGCGGGTG GGGCGCTACA AGCTGAACAA 
GAAATTTCGC TCTGACTCGC CGACTACTGA GTGCACGCTC ACCCTCGATG ATATCG TAAA 
TACCATGAAA TTTCTCATCA GAATGTATAG CGGTGATGCA CAGGAAGATG ATATCGATCA 
CCTGGGCAAC CGTCGTATTC GTTCGGTGGG GGAATTAATG ACCAATACGT TAAAAACGGC 
CTTTTTGCGC ATGGAACGTA TTGCGAAGGA GCGTATGAGT TCTAAGGAAA CGGAAACGAT 
CAAGCCGCAG GATCTCATTT CCATAAAACC TATCATGGCT GCGATTAAGG AGTTC TTTGG 
TGCAAGTCAG C TTTC TC AG T TCATGGATCA GGTCAATCCG CTGGCGGAGT TGACACACAA 
GCGGCGTTTG AACGCACTTG GTCCTGGTGG ACTTTCAAGG GAGCGTGCTG GGTTTGAGGT 
ACGCGATGTG CACTACACGC ACTACGGTCG GATGTGTCCC ATTGAGACCC CCGAAGGACC 
AAATATCGGT TTAATTGTTT CTATGGCCAA TTACGCACGC GTTAACGGGT ATGGGTTCTT 
GGAGGTGCCG TATGTACGGG TGCGTGACGG AGTTGTTACG AAAGAGATTG AGTAC CTGGA 
TGCTATGGAC GAGGATCGCT ACTACATTGG GCAGGATTCT ACGGCGGTAG GACCGGACGG 
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GGTCATCCGT GTAGATCATG TCTCTTGTCG GCACCGGGGG GATTACAGTA CGCGTAGTCC 
TAaGGATATC CAGTATATGG ATGTTTCCCC CAAGCAGATA ATTTCTGTTT CTGCTTC TCT 
CAT AC C G TTT CTTGAGCATG ATGATGCTAA CCGTGCGTTA ATGGGGTCGA ACATGCAACG 
GCAGGGAGTG CCGCTTATTT TTCCTGAACC CCCGCGCGTG GGTACAGGCA TGGAAGAGAA 
GTGTGCATAT GACTCTGGAG TGCTGGTGAA GGCAAAGCAA GACGGAACGG TTGCCTACGT 
TTCCTCAGAG AAGATAGTGG TTTGTTCCGC CGCGGCGTCT GGGGAAGAGC AGGAGGTCGT 
GTATCCGTTA CTTAAGTATC AGCGGACAAA TCAGGATACC TGTTACCACC AGCGGCCAAT 
AGTGCACGTG GGAGATCGGG TACAGGTAGG AGATGCGCTT GCAGACGGTC CTGCAACGTA 
TCGAGGGGAG CTTGCGCTTG GCAGAAACAT TCTAGTTGGT TTTGTGCCGT GGAACGnTTA 
CAACTACGAG GATGCCATTT TGATTTCTCA CCGGGTGGTA AAGGAGGATA TGTTCACCTC 
GGTTCACATC AAAGAATTTT CTACTGAGGT GCGTGAAACC AAGC TGGGTT CTGAACGAAT 
GACGAATGAT ATCCCGAATA AGTCTGAGAA GAATCTGGAT AATTTGGATG CAGAGGGGAT 
CATTCGTATT GGGTCAAAGG TGCGTGCGGG AGACGTGCTT ATCGGAAAGA TTACGCGAAA 
AAGCGAGTCT GAGACGACGC CAGAGTTTAG GCTGCTGAAT TCTATTTTTG GGGAGAAGGC 
GAAGGAAGTG CGTGATTCTT CTC TACGTGT GCCGCATGGA GTTGAGGGTA CAGTCATTGA 
CGTGCAGCGA CTCAGGCGTT CGGAGGGAGA TGATTTAAAC CCCGGGGTGT CAGAGGTGGT 
GAAGGTTCTT ATCGCTACCA AGCGTAACTG C GTGAAGGGG ATAAAATGGC CGGTCGCCAC 
GGTAACAAGG GTATCGTTGC GCGCATCCTT C CTG AAGAAG ACATGCCGTA TCTGGATGAT 
GGTACCCCGC TTGATGTCTG TTTGAACCCG CTCGGTGTAC CTTCTCGTAT GAACATAGGA 
CAGATTCTTG AATCTGAATT GGGACTTGCG GGGTTGCGGC TTGACGAATG GTATGAGTCT 
CCTGTCTTTC AATCTCCAAG CAACGAGCAG ATTGGGGAAA AGTTGATGCA GGCAGGTTTT 
CCGACTAATT CAAAAGTGAT GCTGCGTGAC GGACGCACGG GGGATTATTT TCAAAACCCT 
GTATTTGTGG GGGTTATTTA CTTTATGAAG CTTGCGCATC TAGTGGATGA CAAAATGCAC 
GCCCGCTCTA CAGGTCCATA TTCGCTTGTG ACGCAGCAAC CCTTAGGGGG TAAAGCGCAG 
TTTGGAGGGC a gCGTCTCGG GGAAATGGAG GTGTGGGCGC TTGAArCcTA CGGCGCGGCG 
AATACCCTGC AGGAGTTGCT AACGATTAAA TCGGATGATA TGCACGGGCG TTCTAAAATT 
TATGAGGCAA TTGTAAAAGG GGAGGCTTCG TCTCCTACCG GTATTCCTGA ATCTTTTAAC 
GTGTTGGTGC AGGAGCTGCG GGGACTTGCG CTCGACTTTA CGATTTACGA TGCGAAGGGC 
AAGCAGATTC CGCTCACTGA GCGCGATGAA GAAATGACGA ATAAGATTGG CTCTAAATTT 
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TAAGGGGTGC aGGGAATGAA GGATATCCGG GATTTTGACA GTTTACAGAT 
TCCCCTGATA CCATTCGGGC ATGGTCCTAT GGAGAGGTGA AAAAGCCTGA 
TACCGCACGT TGCGTCCTGA ACGTGAAGGG CTTTTTTGTG AACGCATTTT 
AAGGAATGGG AATGCTTTTG TGGAAAGTTT AAGTCAATTC GGTACCGGGG 
GATCGGTGCG GGGTGGAGGT AACGCATTTC AAGGTTCGCA GGGAGCGCAT 
GAGCTTGCAA CGCCTGTTTC TCATATTTGG TACTACCGTT GTGTACCAAG 
TTGTTACTCG ATCTACAGGT GAnTCgCAtG CGTTCTGTTT TGT AC TATGA 
GTTATAGAGC CgGGCGACAC CGATTTAAAA AAGAATCAGT TGCTCACTGA 
AATG a CGCGC AGGAGCGCTA CGGTGGCGGC TTTACGGCGG GAATGGGAGC 
CGTACCCTTT TGCAAAACCT TGACCTTGAC GCGCTTGTTG CACAGTTGCG 
ATGGAGAAGG GTGCGAAAAG CGACAAACGC TTGCTGCGTC GCATAGAGAT 
TTTCGGGTGT CGGGAAATAA GCCGGAATGG ATGATTTTGA GCGTTATCCC 
CCTGATTTGC GTCCTATGGT GCAGCTCGAC GGAGGGCGTT TTGCTACCTC 
GACCTGTATC GGCGTGTGAT CCACCGCAAT AGCCGTTTGA TTCGG CTC AT 
GCGCCGGATA TCATCATTCG GAACGAAAAG CGCATGTTGC AAGAGGCAGT 
TTTGATAATT CTAAGCGCAA gCCCGCGATT AAAGGTGCGT CAAACCGGCC 
ATTTCTGACA TGCTCAAGGG GAAGCAAGGG CGTTTTCGCC AGAATCTTTT 
GTCGACTATT CCGGGCGTTC GGTTATCGTA GTGGGGCCTG AACTTAAGTT 
GGGTTGCCTA CAAAAATGGC GCTTGAGCTG TTT AAGCCC T TTATTATGAA 
GAGAAAGAAA TTGTCTCGAA CATCAAAAAG GCAAAGATGC TCGTGGAACA 
AAGtATTTTC GGTGTTGGAT GAAGTGGTAA AAGAGCATCC AGTTATGCTT 
CGACATTGCA TCGATTGGGC ATTCAGGCTT TTGAGCCGGT GTTGGTGGAG 
TTCGTCTTCA TCCGCTTGTG TGTAAACCTT TTAATGCTGA TTTTGATGGG 
CGGTGCATGT GCCGCTGACG CAGGCGGCAC AGATGGAGTG TTGGACGCTC 
ATCGCAATTT GCTTGACCCT GCAAATGGGC GCACGATTGT GTATCCATCT 
TTCTGGGTTT GTATTATCTG ACAAAGGAAC GCTCTCTGCC GGAGGTGCTC 
TTTTTCCTCG GTGGAGGAGG TAATGATGGC TGCGGAAAAG GGGGTAATCG 
TCAGATTCAA GTGCGATATC ACAAATGTGA TGGTCAGCTT GTGGTCACTA 
ACTTGTGTTG AATGAGGAAG TTCCCGCAGA GATTCCTTTT GTCAACGAAA 



PCT 

AAAGCTTGCC 

GACAATTAAT 

TGGTACTACA 

TGTTATCTGC 

GGGGCATATT 

TAGAATGGGT 

GAAGTACATA 

AACTGAGTAC 

GGAGGCTATC 

TGAGAAGATG 

CGTAGAAAAC 

GGTGATCCCG 

AGATCTCAAT 

GGAACTGAAG 

GGACGCGCTT 

GCTTAAGTCT 

GGGCAAGCGG 

GTGGCAGTGC 

AAAGCTGGTT 

AGAGTCGCCG 

AATCGGGCGC 

GGGAAGGCGA 

GATCAAATGG 

ATGTTGTCGA 

CAGGACATGG 

GTCCTCGCCG 

GCTGGCAGGA 

CCGCAGGAAG 

CGCTTGATGA 
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PCT/uiB 


R3041 


CAAACGCATC 


AGGAAATTAA 


TTGAGCGGGT 


GTTCAAGCGT 


CAGGATTCTT < 


GGCTTGCGGT 


15780 


GCAGATGCTC 


GATGCACTGA 


AAACTATCGG 


TTATACCTAC 


GCGACCTTCT 


TTGGTGCAAC 


15840 


GCTCAGTATG 


GACGACATCA 


TCGTGCCTGA 


GCAGAAGGTG 


CAGATGCTCG 


AAAAGGCGAA 


15900 


CAAGGAAGTG 


CTAGCGATTG 


CGAGTCAATA 


CCGCGGGGGG 


CACATCACGC 


AAGAGGAGCG 


15960 


TTATAATCGC 


GTCGTTGAGG 


TGTGGTCTAA 


AACAAGTGAG 


GAGCTC AC TT 


CGCTCATGAT 


16020 


GGAAACACTT 


GAGCGCGACA 


AGGATGGATT 


TAATACCATT 


TACATGATGG 


CTACCTCAGG 


16080 


TGCGCGCGGG AGTCGCAATC AAATCgCCAA 


CTGGCGGGAA TGCGTGGCTT AATGGCAAAG 


16140 


CCGAGTGGGG 


ATATCATCGA 


ATTGCCTATT 


CGTTCTAATT 


TTAAAGAGGG 


ACTCAATGTC 


16200 


ATTGAGTTTT 


TTATTTCTAC CAACGGTGCA CGCAAAGGGC 


TCGCAGACAc 


TGCGC TAAAG 


16260 


AcCGCTGATG 


CGGGGTATTT 


GACACGTCGT 


CTGGTTGATA 


TCGCGCAAGA 


TGTGGTGGTG 


16320 


AACGAGGAGG 


ACTGTGGTAC 


CATCAATGGC 


ATTGAATATC 


GCGCGGTGAA 


GTCCGGCGAT 


16380 


GAGATTATTG 


AATCGCTTGC 


TGAGCGCATC 


GTAGGAAAGT 


ATACACTTGA 


ACGTGTAGAA 


16440 


CACCCCATCA 


CCCATGAACT 


GCTGCTCGAT 


GTGAACGAAT 


ACATCGACGA 


TGAGCG TGC A 


16500 


GAAAAGGTGG 


AAGAAGCGGG 


CGTGGAGTCA 


GTGAAGTTGC 


GCACCGTGCT 


CACGTGCGAA 


16560 


TCTAAGCGAG 


GAGTGTGTGT 


GTGCTGCTAC 


GGGCGGAATC 


TTGCACGCAA 


CAAAATTGTA 


16620 


GAAATTGGGG 


AGGCGGTTGG 


GATTGTAGCC 


GCTCAGTCCA 


TTGGTCAGCC 


GGGTACGCAG 


16680 


CTGACAATGC 


GCACGTTCCA 


TGTTGGGGGT 


ACGGCAAGCA GTACTACGGA AGAGAACCGC 


16740 


ATCACGTTTA 


AGTATCCCAT 


ACTGGTAAAG 


AGTATTGAGG 


GGGTGCATGT 


GAAAATGGAG 


16800 


GATGGCTCTC 


AGCTGTTCAC 


GCGTCGGGGG 


ACGCTCTTTT 


TTCACAAAAC 


TCTGGCAGAG 


16860 


TATCAGCTTC 


AAGAGGGTGA 


CAGCGTGCAG 


GTGCGTGACC 


GCGCGCGGGT 


GCTAAAGGAT 


16920 


GAGGTTCTCT 


ACCACACCAC 


CGATGGGCAG 


ACGGTGTACG 


CTTCGGTGAG 


TGGTTTTGCG 


16980 


CGTATAATCG 


ATCGAACCGT 


GTACCTGGTA 


GGGCCTGAGC 


AAAAGACGGA 


AATTCGCAAT 


. 17040 


GGTTCTAATG 


TAGTAATCAA 


GGCAGACGAG 


TATGTGCCGC 


CCGGAAAGAC 


CGTGGCTACG 


17100 


TTTGATCCGT 


TCACTGAACC 


TATTTTGGCA 


GAGCAGGATG 


GCTTTGTGCG 


GTACGAAGAT 


17160 


ATTATTTTGG 


GCTCTACGCT 


CATCGAAGAG 


GTAAATACTG 


AAACGGGGAT 


GGTGGAGCGC 


17220 


AGGATTACGA 


CGTTGAAAAC 


AGGAATACAG 


CTTCAACCGC 


GGGTATTCAT 


CTCTGATGAG 


17280 


TCGGGGAATG 


CGCTGGGTTC 


GTACTACTTG 


CCAGAGGAAG 


CGCGCTTGAT 


GGTTGAAGAA 


17340 


GGCGCGCaGG 


TGAAGGCGGG 


TACGGTCATT 


GTAAAACTGG 


CAAAAGCAAT 


TCAAAAGACA 


17400 


TCGGATATTA 


CGGGGGGGCT 


GCCGCGTGTT 


TCTGAATTAT 


TTGAAGCGCG 


GCGCCCTAAG 


17460 
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pct/uSH 


!n304i 


AATGCGGCTG 


TCTTGGCACA 


GATTTCTGGG 


GTTGTGTCGT 


TCAAAGGACT 


GTTTAAGGGT 


17520 


AAGCGTATTG 


TCGTGGTGCG 


TGACCATTAC 


GGGAAGGAAT 


ATAAGCACCT 


CGTGTCCATG 


17580 


TCGCGTCAgC 


TTTTAGTACG 


TGATGGAGAT 


ACGGTTGAGG 


CAGGCGAACG 


C TTGTGTG AT 


17640 


GGTTGCTTTG 


ATCCCCATGA 


TATCC TGGC A ATTCTGGGTG 


AAAATGCTTT 


GCAAAACTAT 


17700 


TTGATGAATG 


AGATCCGTGA 


CGTGTATCGT 


GTGCaGGGTG 


TTTCAATCAA 


TGACCAGCAC 


17760 


ATTGGTTTAG 


TGGTGCGGCA 


AATGCTACGA 


AAGACAGAGG 


TTGTCTCGGT 


TGGGGACACG 


17820 


CGTTTTATCT 


ACGGGCAACA 


GGTGGATAAG 


TACCGTTTTC 


ACGAAGAGAA 


CCGTCGGGTT 


17880 


GAAGCGGAAG 


GGGGGCAGCt 


GCGGTTGCGC 


GCCCAATGTT 


CCAGGGTATA 


ACGAAGG CGG 


17940 


CGTTGAACAT 


AGACTCTTTC 


AT ATC TGCGG 


CATCTTTCCA 


AGAAACGAAC 


AAGGTGCTCA 


18000 


CCAATGCGGC 


GATTGcAGGC 


TCTGTTGATG 


ACTTGTGTGG 


GTTGAAGGAG 


AACGTCATTA 


18060 


TAGGGCACTT 


AATTCCCGCA 


GGTAcGGGGA 


TGCGG C G TT A 


TCGTCAGGTG 


AAGCTGTTTG 


18120 


ACAAGAACAA 


GCGGGATCTT 


GATGTGCaGA 


TGGAGGAAGT 


TATCAGGCGT 


AGAAAACTTG 


18180 


AAGAGGAGGC 


GCTTGCCCAG 


GCAGTTGCGG 


GTATGGAAGG 


GGAACCTGAA 


GGCGAAGCGT 


18240 


GATGGATTGA 


CCTGGTTTGG 


CTATTCTGAG 


TATCCTAGTC 


CGCGTGTGCT 


GTGTGCGGCA 


18300 


AGGTTTACGG 


TGTTGAGGAT 


TTTTTTGGGG 


AAGTGAGCGA 


AAAGAATGCC 


GACAATTAAT 


18360 


CAATTGACGA 


GGATAGGGCG 


TAAGGCGGTT 


TTTTCTCGTA 


CGAAGAGCCC 


TGCGTTGcAG 


18420 


GCTTGTCCgC 


AGAAGCGCGG 


AGTGTGTACG 


CGTGTGATGA 


CAGTTACGCC 


AAAAAAGCCG 


18480 


AATTCTGCTC 


TGCGTAAGGT 


GGCGCGTGTG 


CGTCTAAGTA 


GCGGGGTTGA 


AGTGACGGCG 


18540 


TAC ATTCCCG 


GGATTGGGCA 


TAATTTGCAG 


GAGCACTCGA 


TTGTGCTGAT 


TCGCGGTGGA 


18600 


CGTGTGAAAG 


ATTTACCTGG 


AGTACGTTAT 


CATATTATCC 


GGGGGGCCAA 


GGACACTCTT 


18660 


GGCGTGGTGG 


ATCGTAAGCG 


CGGTCGTTCA 


AAGTACGGGG 


CTAAGCGCCC 


TCGCGCGTAG 


18720 


GGGCTGGGGA 


GAGGAGTTGG 


TATGGGGCGG 


AAGCGACGGG 


TGTCGCGTCG 


GGTACCGCCG 


18780 


CCTGACGCGC 


GGTATAACAG 


TGTGGTGTTG 


GCGAAtTTAT 


TTGTCGAATG 


ATGCTGGCGG 


18840 


GTAAGAAGGC 


AACTGCGGTG 


GGTATTATGT 


ACGATTGTCT 


TGAACGTATT 


CAGCAAAGGA 


18900 


CTGGTGAGGA 


GCCTCTTCCG 


GTGTTCACAA 


AAGCGTTAGA 


GAACGTAAAG 


CCTGCAGTGG 


18960 


AGGTTAAATC 


GCGGCGGGTT 


GGTGGTTCTA 


CCTATCAGGT 


GCCGATGGAA 


ATTCGGGAAA 


19020 


CGAGGCGTGA 


GGCTTTAGGT 


ATGCGCTGGA 


TTATCGGTGC 


AGCACGCAGG 


CGCACGGGAC 


19080 


GTGGCATGTC 


GGAGCGACTT 


GCAGCAGAGA 


TCCTTGATGC 


GTACCACAGC 


ACGGGAACTG 


19140 


CCTTTAAACG 


TAAAGAGGAT 


ACGCACCGCA 


TGGCAGAGGC 


CAATAAGGCT 


TTTTCGCACT 


19200 
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ATCGCTGGTA GATACGCGTC TCTTCCTGGG GCGTTTGTTG CAGGGGCGGT GTCTGCCCTT 

GGCAGGGGTG TTTTTGCCCT CGTCCTTTCT CTTGATTCAT CTGGACGTCG GTTTTGGGTG 

GCGTGCTCTT GTGCGCCTTA TC AG CAT AAA CGGAGGGTCC ATACGGTGGG GGGGCTACTC 

TCGGATCCAC ATAATTTTGC GCGCGCGTGT GCCCTCTTTC GTGGAATTTT CCGCAAGGGA 

AGAGCGCTCG GGGGTGGTTC GCGCAAAGCT TCAAGTGCCC TGT 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4724 base pairs 
J (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




13041 

19260 
19320 
19380 
19440 
19483 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CCTTTTTTCG ATCTGTCCAA TATGAGTGGT TGGACGAGCG GACATTTTGT GGAAATGGAA 60 

TCCGCTCTGT CTGAGTATAA AAAGTCAAAA AAnCCGCTCT ACGTTTTTTC T AC C TC TT AC 120 

AGTTTGGCTG ACTATTACAT CGCCTCTTTT GCTGATGAAA TTATCCTTGA TCCGATGGGG 180 

TCTGTGGATC TGTCGGGCTT TTACACGGAA ACTCTCTTTT ACGGAGGTAT GGAGGAAAAG 240 

ATTGGGGTGC GTTGGAACGT CGTGCATGCT GGGGTGTAnA AGGGCATGGC TGAGATCTTT 300 

TCTAGGAAGG ATTTTTCTCC TGAGGTTCGC AGAAATTATC AG TC TGT ATT TGCGCGTCTG 3 60 

TGGCAGCAGT ATCTCAGTGA TGTTTCGCGT AATCGAGCAC TAGAGGTGCA GCATCTTGCC 420 

CGTTACGCGG ATCGTCGCCT TGAGCTCCTG CAGAAGTATA AC GG AG ACGG TGCGCGCACC 480 

GCATTGGCGG AAAAGTTAGT AACGCGCGTA TGTTCCTACG ATGAAGCTGG CGTTGCGCTC 540 

AAATTTTTAA AAGAAGACGA CTACGAATCT GCAAAAAATT TCGTTGGTCT AGACGATTAT 600 

AATCGTGACC GTGCACAGCG GCAGGTGCAG GATCAGGTGG GGATTATTCA TCTTGCAGGA 660 

CCGATTGCTG CACACAGGGA T ACGG AAC TC GGCGGAACGA TCAGCGACGA GGTTAGTGCT 720 

TTGTTGGATG TCGCGATGAG TGATCCGGAT ATTAAGGCAG TAGTGTTGCG TATTGATTCC 7 80 

GGTGGGGGAG AGGTGTTTGC TTCTGAACGT ATCCGCCGCG CGCTTGCGnG GGCAAAGCGT 840 

CGAGGCAAGA AGCCAGTGAT AGTATCGATG GGTGCGATTG CTGCGTCTGG TGCGTACTGG 900 

GTTGCTTCTG CAGCCGATTA CATCTTCGCA TCCCCCTATA C CATC AC TGG TTCCATAGGG 960 

GTGCTTTCGG TACTACCGAC ATTCGAAACG TTTTTAGAGC GATATGCGGG GATCACTGTC 1020 

GATAGCGTAC AGGTGCACGG CGTTCGCCAA CCTTCTTTGC TCAGGAGTGG AACGGCTGAA 1080 
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GACACCGCGC GCATGCAGCT TGATGTGATG GCGACGTATC GTACTTTTCT TTCGGTTGTT 
TCTGCCGGGC GTAACCTTAC CCTTGATCGG GTGGCGGCGG TTGCAGAGGG TAGGATTTAC 
GCGGGGGAGG ACGCAtTTCC CTAGGCTTGG TTGATGCGCT AGGCGGACTA GATGAAGCGG 
TAG C AC ATGC AGCGAAAGAA TCACATTGCA GGCAGTATTC GGTG AG AG TT TTGAAGCGGA 
CCsCACGTAC GGTGAAGAAT TTCTGCAGTC CCTGTGGGAT GTCCTGCAGA AACGAAtCTT 
GCTTTTGGAG AGCGTGTGAT CATTGGAGAG TTACTCCAGC TTGACCTAAG CAAGGGCACC 
TACGTATATG AGCCGCTGCG CTTGCATTGG CGTTGACGGG CACTGCTACG CTTGATCGAG 
CGCACGTnGT TTGCTACGGT TGGCGCCGGT TTTTGGGGAT GTAGCTCAGT TGGTTAGAGC 
GCTTGCATGG CATGCaAGAG GTCAGGGGTT CGATTCCCCT CATCTCCATC GCCGTGTGTG 
AGGGAGGGGG TGTGTCTGAT TTAGGTTTAG ATCCGGATCT GTTAGCTCTG CTGCAAGATA 
CGCCGCAGGt GTGCCGTCTG AGCATTCTTC TGCAGGGAAG GGTACAGCGA TGTCGCCTAc 
CGGGACGCGA GATCCGAGTG ACGTTGATCT TTCTGAGCGT AgTTTTCCCT TGGTTACTGA 
GTTTCAAAGC AAGACCCCGC ACCAGTTTTT TGAGTCAGCA GAGTTTTATA AACGTGTCGT 
TTCGGATGAG TTGGAAGTTG GGCAGCGTGC GCATGCGGCT TTGGCGCGCT ATTTGTCCAC 
CACTGACTTA AAGGATCGCT CTGTGTGCCG GCAGCAGCTT ATTAGCAGTT ACTGGCAATT 
AATGGCACAG ATATCGGGGA AAATCGGCGG TGGGTCGGCG TGCATGGAAA AGCGTTACGC 
ATTGCGCTAT GGACTGTTGC TTCCTACCTT GTTGACCGCA TCCCAGAAAG ATATCTTCGC 
GCGGATTATT GAGACGAATA GTTTGCAGCA GC CTCTTT AT TATCTGGATG AATGGCTGAT 
TGCGATTGGT TCTGGAAAGG TTCGCCCTTC AAGCACCGAC GAAgTGCAAG TAAAAAGGAA 
AGACGATGTC GCACGCGTAC GGCAGGCGTA TGATAAAGCG TGCGGGCAGT TGCAGAGTTC 
TGAGCGTCTG TTGCAGGTGA GGTCGGCGGA gcGTGCCCGT GTGGAAGAGG AGGTGAAGAA 
CAGAATTTCG CGTC TTTTCG TGCACGAATC CATTGAAGGT CTCCCTGGGG TGACAGCAGG 
TTTCAACGAG GCGCAGAAGC AAGGAATCTC GGAGATCCAT GAATTGTTAA AAAAGTTGTT 
GGGTATAGAT CGGGAGTTTA ATGGGTTATA TGCGGGCTAC CGCGCTTCAC AAGACGCAgT 
GCATTCCCTG CGAGAGAAAC TAGATGCGCC CAATGCGGAG AACAGTTCAG CAGTGAGTAC 
GGAGTACGAT aCCGTGCGCC AAATGATAAA GATG AG C TGC GGGCGCCAGG GCAACCATTT 
CCCCCTCTTG TCCAGAGAGT ATTTCCGTTC TGCGGAGCAT GAGATTGGCA CGCGGGAAAA 
TGTATTGAAA ATTATGGCTT GGATTGAAGG TCTGGATCCG GAAGCGTATT GCCGTCAGTA 
TAAGCAGCAG GTAAACAGGA TTCCGCCATT CGTGGTGCTG TTGCCTTCTT ATGGGGACAT 
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AGGATTTTGT TGGGAGCCGT TTGATCGTTA CAATCGCGTG ACAAGCCGTG GACGCGTTGC 2 880 

GtGCCTATGT ATGGAAGGAG CTTGAAGCTT GCAGTTATTA CCGCGACGGC GGATTTACGT 2940 

TGGCAGGTTG CAAAGGAAAA GGCTTCGTAT TACTGGATGG AAGAGGGCTT GACGGGGAAT 3000 

TATTATCAGT GGTTTCAACC CCAAAAATTA AGGGGTGATG TAAAGGAGTA TTTTATTGCC 3 060 

GATTACACGA CCTGGCTCCT GAAGGAAAGC GAGGGCATCC AGAAACTGGA CAAAGAGGTC 3120 

CGCAATGTCT TTTGGCGCTA CATCCCCTTT CCCCAAAAAA TCAAAGACGA ACTCAAGACA 3180 

AAGTCCTTTG TGTACCAAGA GCTTTGTCAG AAGGACGCCA ATCGCCAGGT ATCTGACGGC 3240 

TATTGATAGT TTCTCCTGAA TCGGTTGGTG TCCTGTCATG AGGGGATAGC TTGTGCGCCG 3 300 

GTGTCGGGTG TTCGTTGACC GAGAAGGGTC AGGGTGTTTT TnAAGCTtys CTCTCGCGCG 33 60 

ATTGATGGGC AAGTCTACTG CAAGCAGGCG TGCGAGGTAG ATCCCATAGT GAGGATGATC 342 0 

CTCAATCAGT GAGATGAACT TCATCTTTGA TATCTTTACC AGTGTGCCTT CGCCTACTGA 3480 

TACAATCGTT GCAGAGCGCC GGTTG TTG AG CAAGAACGAC ATTTCCCCGA TGAATATGTC 3540 

TGATGGGGTC AGCATGGACA TGAAC TTGTT ATCCACGTAC ACTGCGAATT TCCCCGACGA 3 600 

AATGTAG AAA AGGGAACTGG ATTCTTCGTT CTGGTAACAT ACCACCTGGG CATCCCGGAA 3660 

CGTTAGTACC TGTTGGTTTT TCAAAATTGA AGGCACAAGG TTCGCGACGT TTTCCTGGTT 3720 

GTCAATTTCA AACGTCACCT CGTTGCCCAC GTCGTTATAG CTGAGCCTTT TGACAAAAAT 3780 

TTCTGTCATT TTTATGCCCA TACCGTGTAG ACCAGGTTTG CACGCGCTTG CCATGCGGCT 3 840 

TTTCCAATCA AAGCCTGTGC CTTCGTCACG AATGGTAATG CGTGTACGCT GCAGTGTAAT 3900 

GTCATAGGAA ATATAGATTT TTTTCgCGCT AATGCGCGGG TCCtGcTTGC GCAGAGCAAT 3960 

CAAATCAAAG ATATCCTTGC GCTGTTCGAG CCaCTCTGTT TTTTCGTCGT AGCTAATGCC 402 0 

GCAGTTTCCG TGCTCCAGTG CATTGAGTAA CAGTTCCATC ATTGCGCCTT CAAACGAAGT 4080 

GCGTTCAAGC TCATTGATAC GGTTGGTATT GTACAGGTAC GAACTAATCA AGCTGGCGTA 4140 

AAAGGTAATC TCAAAGGAAT CGGTGTCGCA GATAAAATTT CCCTGCTCGT. GTCCATGTGC 4200 

TTGGTGCACG AGGCTGCGAC TAGAAAGAAA GTGTCGGTTT CTGTCCACGA TTCGCACAAC 4260 

CTGGGAGGCA TGCGCTTCAA ATTCCTGCCG CGTGGAAACT GAAAGGAAAT TCGGGTCTTT 4320 

GCGATTTACG ATTTTTATTT TTTCTTCCAT CGAGTTAGTG ATAGCAATCA CCCCACCAAA 4380 

TAGAAGCCAA GGATCATCCT TTATAATTTT TAAACACGCT TCGCTGTCGA CGTTTGGGTC 4440 

AC CAAAATC A ATAATCTTAA TCTCAGGCAT CTCGAAGCGA AAAACGGATG CTATCTCATT 4500 

CAGACGAGAG AGCGTCTGAA TGTGTATATC CACGCGTTCT CCAGTACACG CACCGTTAAg 4560 
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GCAGATATGG TAGACGTAAC CGTACTGATA AGAGGTATTT g CTC ATACTC ATAATCCTTg 
TTATAGAAAT CGAGCCACGG TAATCATCGG TTGACTTATC ATcGAGAATG AG ATCTGGc T 
ACGCATTAgG TATATCGTGT GGGGGCATGC GCnTGGGAAC AGGC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



3041 

4620 
4680 
4724 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

TAGCC TGCCG TGGCGCACCC CTGCTTTGCT CCACGCCGCG CTCTTTACGT TCCCCGCACA 60 

C ATGCGCT AC ACTCCCCGCC ACCGCCGCAG GcAGGGCCCC GTGTTACAGG ACCTATCCGC 120 

AAATGCC CGT AAGTAC TGCT CGAGCTCGGT CAGACCGTTG CGCGTGAGGC TCGCGCCTCG 180 

CTCCTTCAGC CAGAGCACAT TCTGCTCGCC CTCATTCAGC ACAAAGTAGG CCGCGGCTAC 240 

AAGCTCATCG AAAAACTCAT TGAAGATGTC GCTACCGTCC GCCTCATCCT CGAGCAACAC 300 

GTCCTTACCA ATGAGGGAGA CGTCGCCAGT CCCCAGGACC TGCCCGTCTC AGGACGCGTC 360 

AAACACTTGC TCGACATCGC AGCAATGGAA GCACGCTCCc TGCGGTGCGC TTACATCGGT 420 

ACCGAACACC TCGTTATCGC CTTTGCCCGA GAGGAGCAAA ATCCTCTCTT CCAAAGCCTC 4 80 

ATCCGAGAAG GACTCTCGCT CGATGACCTG CGAAACGCGA GCATTATATC CTCACCTCAT 540 

TCTGATACCA CCCGCACCCG GCTCGAGCGG AAAGTTGCAA GTGTCCTTGA CGAATACGGC 600 

ACCGACCTTA CCGAACGCGC GCGCGCCGGC GCCCTCAATC CGGTCATCGG ACGAAACAAA 660 

GAAATTACCC GCGTCATTCA AATCCTGTGC CGGAGAGGAA AAAATAACCC GGTGCTCATC 720 

GGAGAGCCAG GTGTCGGGAA AACTTCCATC GTTGAGGGGC TCGCGTACGC CATCGTTCGG 780 

GAGGAGGTCC CGCACATCCT GCTGCACACC CGCGTCGTTT CCCTAGACCT TGCCGCCGTC 840 

ATAGCAGGAA CAAAGTACCG CGGCCAGTTT GAGGAGCGGC TCAAACGCAT TATTAAGGAG 900 

GTGGAAGAAA CTGAAAAAGT CATCCTTTTC ATCGATGAGC TGCACACACT CATCGGAGCA 960 

GGAGGCACGC AGGG GTCTTT GGACGCCGCC AACATGCTCA AGCCGGCCCT TGCACGCGGA 1020 

CAAATCCAGT GCATTGGGGC AACAACCCTG GCAGAGTATC GCCGTTACTT TGAAAAAGAC 1080 

GCAGCTCTCA CCCGCCGATT CCGATCGGTG CTCGTGCGTG AACCGAGCTT TGAAGAAACC 1140 

TGCACTATTT TACGCAAAAT AAAATCACAC TACGAACGAC ATCACCAGGT GATATACCAA 1200 
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AGCGATGCGC TTGAAAAAAT TGTTGAGCTT TCACGGCGCT ACATCCCTGA GCGGTTCTTT 
CCAGATAAGG CAATTGaTCT TATGGATGAA GTAGGAGCCA TGAAACGGGT ACAACAGCGC 
GCGGATACGC AGGTATTGCG TTCCTTTTCC ATAAAAGTTG CTAATCTTAC CACAGAGACT 
GAGCGCGCCA TTGCGCTTGA AGATTGGGCG CGCGCGCGTT CCTTACACAC CGATGTGGTG 
CAGCTGcGCA GACGG CTCC A CGCGCTGAAG GTAGAGTGGA GCGCGCGCGA AgyGcgTCTA 
TCTTTGcAGA AGATGTTG C A CAGGCTGTCT CTCTCATGAC CGATATCCCG GTACATTCGC 
TCGAAGGGGA TGAGCTGTGC CGCTTTACCA ATATCGAACG GGATCTTTGT GCCACCGTGC 
GTGGGCAGCG CGAGGCCATT GCAACGCTCG CGCGCGCTAT CGTACGCGCG CGTGTCGGCA 
TCTCTTCAGA CACGCGCCCC ATTGGCTCCT TCCTGTTTCT TGGACCGACC GGTGTAGGCA 
AAACGCTCTT GGCAAAGACA CTCGCGGAAT TTCTTTTCGG TTCAGCAGAC GCGCTCATCC 
GCATTGACAT GAGCGACTAC ATGGAACGCT ACAACACCTC ACGCCTCATG GGAGCACCGC 
CTGGATACGT GGGATTTGAA AATGGCGGTC TACTTACCGA GCGCGTACGG CACCGCCCTT 
TTTCTGTCAT CCTTCTGGAT GAAATTGAAA AGGCGCATCC AGATGTCTTC AATGTTCTCC 
TCCAGGTGTT AGAAGAAGGA GAGCTGCAAG ACAACCTGGG GCACACGGTG AACTTCCGCA 
ACACTATCAT CATCATGACC AGCAATGCAG GCACACGCGG CCTGGGGGAA AACGTTCCTG 
GCTTTCAAAC CGCACGCGCG CGAAACATCG AGTACCGTCA Gc TGCGCGT A CAGGCCcTCC 
GGGAAATAAA ACGCATCTTC TCTCCGGAGT TTCTCAATCG CGTTGACGAG TGCGTAGTGT 
TTGCTCCGCT TGAGCGAGAG ACCCTGCAGG AAATTTTAGA ATGCGAACTG AAGAAGCTCG 
CAGAACGCCT ACGCGGTAAA GATATTGTGC TGCGCTACAG CGCGGCTGCA AAGGCCTACT 
GTCTTGAACA CGGCTTTGAC CCATTCTTGG GCGCACGCCC CtGCGCCGCG TATTGCAGCA 
AGAAATTGAA AATGAGCTTG CGc TGCGCAT GATTCACGGA ACGTTGCGCG CAGGATCGTG 
CGTGCACATA GACTCAGACG GCGCGCGCCT CCACCTTTCT ACCGAAAAAA GTTACCTGAC 
GCTGCATCCC CAAGAAATAT AACTAATCAG TCACACGCGC CCGTATCTCC CGTACCTGCA 
GGTCACTTTC CCACACAGAG CTTCTCAAAC AGCGCATCTA GGATATCTTC GCTGTGCACT 
TCTCCAGTAA GCGCCCCACA ATGATAGAGC GCCTCTTCCA GATCGTGCAC CACTGCATCC 
AACCCGAACC CACGTGCATA CGCCTCCTGT GCATGCTCCA ACGCCTGCAC TGCGGCGTCT 
ACCAATACGT ACTGGCGTTC TGAGCCAAGA GAAAGCTCCT CGTACGGCAC CTGACCGCCG 
TGCAGCAGGT GGAGTGTCTG TGCACGGAGC GCGTCCAACC CCGCGTGAGT CTTTGCGCTT 
ACACACACGA ATGc gCGCGG CGCACGATCC CTCACCTCCC CGTTCTTTCC CCCTGCTAAA 
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CACTGCTCCC CCGCCCCGCG CGCGTCCTGA CTGCGCGCAC ACGACAACAC CGGTGCCGAT 3000 

ATAAACGGCT GCACTGCCTG ACACACCTGT ATGCGCTCAG ACATAGACAT CAAATCGTTG 3060 

TGGGTAACTA CCACTACCAA GGGTACTGCA CAGTCCGAAA GAAAAGCGCA ATCTGCAGCC 3120 

TGCACACCTG CACGTCCATT AATAATGTAA AAAACGCAAT CTGCTCCCTG CAAGAGTTGC 3180 

TCGCTGCGTA CCACTCCCTG TGCCTCAATA GGATTGTCAG TTACTCGTAA GCCTGCCGTA 3240 

TCACACAGAC GC AC TGGAAT GCCCGmTAGA TCAAGGTCTG CTTCAAGCCA ATCGCGCGTT 3300 

GTACCCGGAA CGGACGAAAC GATGGCACGA TCCTGTCCTA AAAGAGCGTT GAAAAGAGAT 3360 

GATTTACCCG CATTTGGACA ACCGCCGAGC ACGATGCGCA CTCCCGTTCG CTGCAGCGCA 3420 

CGCTCCTGCC AGCAGGCACG GAGCCTGCGC AGACGTTCTA CCAACGGTTC AAGTTCACGC 3480 

ATATCGATAT CGTGCACACG CGTTTCTTCA TCTTCCGGAT ACTCAATTTC CCCCTGAAGC 3540 

GTGGCTGAAA ACGCGAGTAA CGCACGGGTA AGCGCTGCTA TCTCCTGCTG CAGCGCACCT 3 600 

GAAAGtGmAA CACCGCTTGC TGcTGCGCCG CACACGTGCG TGCATCAACT AG TG AC TG AA 3660 

TCGCCTCAAT ACGCGTCAAA TCCCTTTTAC CATGAAAGAA TGAACGAAAA CTAAATTCAC 3720 

CTCGCTGGGC GGCACGGAAC CCnTGCGCAA GAC AGAGCCG ATACACAGCC TGTACGGTAC 3780 

GCACGCCCCC ATGACAAATA ATTTCTACCG CATGTTCTCC CGTAAAACTG TGCGGTGCGC 3 840 

GGTACACCAG CAGTACTACC TCATCCACCC GTGTCTTTCC GTCCAAAATC CATCCGTGGA 3900 

GAAACGTATG CGCACGTGCG CGCGTCAGAG CCTGCGCACG AGAAAAAAAG GACGCAACAC 3960 

GCTCAATGGA GCTGCTCCCA CTCGTGCGGA CAATACCTAA CGCGGCAGGA CTGAGCGCCG 4020 

TGGCAATGGC GACGATGTCA TCGTCGAGCG CATACTCATG TGCGCGCATC AGCTACCGTT 4080 

CCCTCACGGC CGTGGGCGCA GGTGCGTGAA AAGGCAAGCC GCCTGCAAGT CCCACACGGA 4140 

GAAAAAGCAG CGGCACCACT CCCGTACTCA GGGAAGCTCC TATACCATAA CGCACAAAAC 4200 

GCAACAGGCG TGCATACTCA GCGGGCGCAC CCGAGAACAG TGCATGAaGC GCAGAaGACA 4260 

CCACACTACA GAACACACAC CCTACGCAAA AGCGCACCAC CTTCTGGGTG AAAGACCCCC 4320 

CAGCAGCATG AAACCCCACG CCGcCTGCAG GTGCCCTGCG CGTAACCCAA CGATCAAAGA 43 80 

TAAAGAGGTG CCCTGCTCCT AACCCGCAGA ATGCACCACA CATCGACACA TCCGCCCCAC 4440 

CGACAGCTAG CAAGAGTGCT ATGCACACCA mGGCAGAAAA GCGCAACCCC GCAAAAtGCG 4500 

TCCTGAGACC TCCTGTATGC GCGTAAAGAA CACAACCCCA CGCCTGAGCG CGCGCCGCAT 4560 

aCCGAGAATC AGCGCGCCAA AAAGTACTGC GTTTAACCAA CCAAGAAGCA CATCAATAGG 4620 

ATAGTGCACG CCCAAATaCA CACGAGAAAG CCCAATGACT CCTACAAATA GCACGCCCGC 4680 
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CACGCCCGTC CATGCAGTGC GCGC CAGCGG GtaCGCTCCT GTCCATAAGA CGACGGGTTC 
TCCCGATCCC CTACCTGCCT ATCGGTTCCA CAACCTGGCG CACAAAAGCA GGGAACAGGA 
GTCTTCGCAG ACGGATAgcT ACGCGCGAGC AACACAAACA AAGCACTCGC CTGTGCATGC 
CCGGAGGtGT AGAAAAACCA TCGTGGAACA CAAGTTTCAC CGACGGGTCA CGCACAAAGG 
GCCGCGGGAC ACGCAACAGC CCCTTCAGGG CGTAATTGAG CCCCTCGCTA CATGCCAATG 
CGTAGGCAAT GGCTAAACCC TTTCGGTACT CTACGCACCA CAGCACCCAG AGCGAACACA 
GGGCGATACC CTTCCCTCCA AAGAAGGTAA AAAGAACAAC CGCGTGTGTT ATCACAGGGT 
GCGCAgCCTG cTGCACCGCG TGTATGACGG ACAAGTTCCA GAATATAAAT TCTTCCATGG 
TGTCCTATCC TCACTTTGAC ACGCGCGTCT GCATCGATCC GCGCTCCGTC CTGTGCGGCA 
GATCCTCAAG CCAATAAACT CTATCCGGAG CGCGCTCCTG CACAAAAATG gCGCTGTAGC 
GCCCTGaATG CTGCACACGT ACGCTGCCAG GAAAATACTG CGTGTGCAAA AACCACACAA 
GATCGCACAT ATCGTCGCGC GAATGgAAAc CGAGTAACGG TAGTAGAGCG TTGCATCGGT 
AAGCAGCGCT CTGCgCTGGa TCTGCTGTCT CATGCGGGTC AACTGCTGCG CGTACATTTC 
CCCGCAATCC CCCATACCAC GCGTGCGCCG CAGTCAGGAG ATTAAGCGCG CGCGCTTCAT 
CCATGTAGTA CAACTCGTAC ACACCTGACA CTGCAGGTAC CGCAGTAGAA ATGCGATACT 
TGTCCACCTG GGTGAGTGCC GACCACGTTA GCACGTACAG AACCTGCTGC GGGkTTCCCT 
CAGGAACCCC AACGGGCGcA GGTCTCAGTT GCTTAGTGAT CAACGGCTCC AAGGACTGCA 
TGTAACGCAA ACTCCGTGAC AATACAAAAC TGGGAAAAgA GAGAAGCACC CGCCAAGACA 
CGCGCAGGCC GAAc GAAGGC GCCGATCAGA AtCGAACTGA TGCATAAAgG TTTTGCAGAC 
CTCTCCCTTA CCAaTTGGGc ACGGCGCCGA GGACCCCTCA GGCTAACAAA AAAAGACGCA 
ATCGTTCAAG GGTAAACCAA CCGATACTCC AGGCACGCTG TGACCTTGCG CAAAGGGGAT 
TACCATGGAA AAACAGTCAC CCGCACAAAC TATCTCGCTC TTCGTGCTCC TCGCGCTCAT 
GTTTGTACTC GTGTGCATGC TGTTCGTACC CTACtAACGG TGCTTCTCTG GTCGAGCATC 
CTTGCTATCC TGCTTTCACC GTGTTATCGC gCACTGTGTG C a AG AATAG A TATGCaTGCT 
TTTACGCGTA CTCGACATCT CGTTTCTCAC ATGAATGGAG AGGATGGATG TACCGCGGCG 
ATTACCCGAG CGACGCGCTT TCAAAAAAAG ATGCTCGCAG CGGTATTTTC ACTTGTGATT 
ACCCTTCTGG TGACCACTGT ATTTTTTTTC ATTGCAATTA GTTTGTTTGG ACAGGGAAAG 
CTCTTGTTTG ACAAACTTTC GCTCTTCTTC AGGGAATACG ATCTATTTGA AGGTGCAAAG 
CAACGGAGCT TTACCGCGCT TATTTTTAAA CTTTCCCGAG GAACGGTTGA TATCTCTACC 
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CTCAATGTGG AGGAGCATCT GCTACGGTTC TTCGGCAAGC ATGTAGAATC GGTGTTTGTG 
TATACACAAA TTTTTGTCAA AAACATCGCT CGCGCAgCCC TTTCCACGTT GTTCTTTAGT 
TTTACCCTAT ACTTTTTCTT TCTCGATGGG GAACATTTGT CCTGTCTGCT CATCGCTGCA 
CTACCCTTGA GGAAgCGCGC AAgCGcACaG TTGTTAGAAA AATGCAAAGA GGCAACGCGT 
CATTTGTTCA AAGGTCTATT CTCCATtGCT TTTTATCAGA CCTGCGTTGC ATTTGTGTTC 
TACGGAATCT TCCGCGTGGA AGGACCGATG GCTTTAGCAA TGCTCACCTT CTTCGCCTCA 
TTCTTACCAC TGGTcGgcTG CGCCTGsGTG TGGCTCCCAG TGGGAATTAG CATTGGATTT 
ACGAGCGGGT GGATGCGCGG CACCCTTTTC TTGTTTGTCG CTGGAAGTTC AATCACTATC 
ATCGACAGTT TCTTGCGCCC GTTGTTGCTG CAAAATAAAA TGCGCATCCA TCCATTGCTT 
ATTTTTTTCT CTATGCTCGG TGGGGTGCAG ACGTTCGGCT TTAACGGTAT GGTGCTCGGT 
CCTATTTTGG TTATCCTGCT GTTCACGGTT ATCGACTTGA CGCACGACGG GGAGTCTCAC 
TACACGTCTA TTTTCCACGA CCCCCCTGCT GCAGGTGTGC ACGCGCAGTC GATACACAGA 
CAAGGAAAAA AATAGGGATA TCTTGCTGCT CGGCGCCCTT TTTATTACCA TGCGGCCCAT 
GACGCGCGCG TGTATATTCG ATCTTGATGG AACGC TAACG AATACGCTGG GGACCATTGC 
CTACTTCGTC AATATGCAGG CTGCCCaTTA CCATTTACCC CCAATTCCCT CTGAAAAGTT 
TGCGCTGTTT TTAGGAGATG GTTCGCGCGC ACTGATTCAG CGCGTGCTtG CTCATTACGG 
CGCTGCAGCT CAGACTATTT CTGAGGATGA ATTTTTACAG CGCTACTGCC TCGCGTATGA 
GGCAGACTTT CTCCAACGCT GTACTGTATA TCCGGGGGTT CCTGAGATGC TTGTGGAGTT 
GAAACGACGC CGCATAGAAC TCGCCATTCT CTCCAACAAG CCACATTCTA TCGCGCAGAA 
GGTAGCGTCT GCTTTTTTTG GGGACAATGT TTTCTCAGTG GTGCTTGGCC AACGCGAAGG 
CGTACCCGTA AAACCAGATC CTGCTGGGCT TTTTGAGATC CTGCGTACCC TAAACGTGGA 
GACGGCGGAG GCGCTTTTCG TCGGAGACAC CGCCGTGGAT ATACGCACCG cGTcCGCAGC 
GCAAGTGCGC AgCGTGGGaG TGCTCTGGGG CTTTCGAGAC GAGACGGAGC TATCCCAGGC 
GCAAGCCCAC GTGCTTATCA GGACGCCCGC CGAGTTACTC CAGCACCTTT CTTTCTAGAC 
TCGCGGGTAC AAACTCAGAC GGAGCGCACG ACGCTCCCGG ATCCCTGCAg GGCACGAGCC 
GCTACTTCTC TTCACGCCCA ACGCAgTTCG CCCGCAGGGT ATAGCGAAGT CCACGCAGCA 
TCAGTGCCAG GGCGCCATCC CCAGTGATGT TACACGCAgT CCCAAAACTG TCTTGCAAAG 
CAAATATCGC AATGAGCAAA CCGGTTCCTG TGGTATCAAA GTGCAACACA TCAAGC AC C A 
GCCCGAGCGA CGCAAGCACC GTACCCCCTG GAACCCCCGG CGCACCTACG GCAAAAATGC 
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CGAACAAACA GGAGAACAGC AC CAT ATCTG CAAGAGAGGG CATGGACCCG TACAACATCT 
GCGCTATCGT TAGACAAAAA AAGGTCTCCG TCAGAACAGA CCCGCACAGA TGTGTGGTTG 
CACCCAGCGG GATCGCAAAA TCCACAATTT CTGCAGGCAG TGC CCGTG AC TTGTGCGCAC 
ATTGTAACGA AACCGGCAGT GTTGCTGCAC TCGACATCGT GCCCAGCGCA GTCGCATACG 
CCGCTCCATA ATGACGAAtA CCTCGAACGG ATTTTTGCGT GACAGTATCC ACCCCACCAG 
GTACAACACG CACAGCCACA GGAGATGACC CACAATGACG ACCGCTACCA CTTTGGCAAA 
AAGCGGC AG c TGACGAGTTA AACTCCCGCT GTACGCAAGT TCTGCGAAGG TAGCCGCCAC 
AAAAAAGGGA AGCAGCGGCA CCAACACTCG GCTAATAGCT TCACCCATCA TGCGACGAAA 
TTCATACAGC ACCTGCTCCA CCGCACGTGC TTTTACCCAG AGGGCAGACA GCCCCACCAA 
GAGAGCAAAA GCAAGTGCAG TGACCACGGG CATAAGAGAC GGAATCTCAA GGGTAAAGAT 
AACCTTAGGG ATTGTACGCA AACCCTCCAC CGTGCGCGGG ATCCGAAGAT ACGGGATAAC 
AACACGCCCC ATCGCGGTGG CAAAAAGGGA GGCACCCACC GAAGAGAGAT AGGAAAGTAC 
CAGAAACGAG CCTAGCATCC TACCGGCACT CGCTTTCAGA CTCAGGACAG TAGGGGCAAT 
AAAACCAAAA ATAACTAGGG GAATAACAAA AAAAACAACC CCGCCGATAA GCGTTTTCCC 
CGTGTGGATA ATGGCCATGA CCGACTCATT AACGCACAGC CCGAGCGCAA CGCCACAGAC 
CATCCCCCCA CTGAGCTTTG CGAGCAGCCA AAACCCCGCA CTCCCGGCCA TAGCGTCCTC 
CTCCGCACAC GCGCCGGcAG TATACCAAAA AGACTATCCT CTGATAACAG GTCAGCGGTC 
TTTTTATGTC ATAGAACCAA CCTCGAAGGC GAGGCAAAAC AGATCGAACC CGCACCTCCC 
AAGAACTATG CAGGAAAGAC GCACCGACGG GTTGCATGCC GGGGCGGAGc GCACCCCTAT 
GCAACAACCC AGCTTACTCC CACTACGGTT CACTCAAAGA ATGTTCTCGA ACTCCTCCCT 
CACCGACCGC GGCCATACAC TGGCCTGAAC TTCACCAATA TGCTTCCTTT GTAGGAGTAG 
CATCGCCAAA CGGGATTGAC CGATACCACC TCCGATGGAT TGAGGAAGAC GACCATTGAT 
CAGATCCTGG TGCCAGyTGC ATGCCAGACT ATCCTCATCG CCaGTAAGAG cCAGCTGCGT 
GCGAAGCGCA CCCTCGTCCA CCCGTATCCC CATCGAAGAC ACTTCAAACG CACGCCCCAA 
CACTGGATTC CACACCAAAA TATCGCCGTT CAGGCCCTTG TATTCCCTTC GGAAGGCGTC 
GTCCAGTCAT CGTAATCTGG AGCGCGCACA TCGTGCGGCT TGCCGTCAGA AAGCACACCA 
CCGATCCCAA TCAGGAACAC CGCACCATGC TCTTTGCAAA TAGCATCCTC ACGCCCCTTG 
CTGTCTAAAT GCGGATAACG CCGCACCAGC TCCTCGCTCT GTACAAATAC AATATCCGCA 
GGCAAAAACG CCCGTAGgCC GAACCTCTCA CTTACCAACA CCTCCGACTC CCGAAGAGCA 
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CCGTAGACCT T AC GC AC CG T GTCCTTCAGA TACGCAAGAT TTCTCGACCC TACCGGTACT 
ACCTTCTCCC AATCCCACTG ATCCACACAC ACAGAGCGCA CCTGATCCAA GAAATCTTCA 
TCCGGGCGGA GcGCGATCAT GTGTACAAAC AAACCCTCAT TATCCTGAAA GCCGTAGCGG 
GCAAGCGTGT GGCGnTTCCA CTTTGCTAAC GAGTGcACAA CCTCAAAGGC AGTACCCGGG 
ATCTGCTTcA CGGAGACGGA AACCGCCTTC TCCCGACCTG AAAGACCATC TTGGATCCCG 
TCACCCACCT GGCTCAGAAG AGGTCCCTGA ACTTCTATGA GTCCCAGGTG CTCCATCAGC 
TTTTGGGTAA ATGTGTGCTT GGCAAAGCTG ATCCCCTGCT GTTGCAAAAT AAATGATTTT 
TCCATAGTTA ACGCCAACCT TTTACCTTGT TGAGTAAACT GTGCACGCAT TATTTAATAG 
GGTGGCGGTG TAGTGCAATA CTCAAGTAAT CTGACAGCAG GGAGGTGGTG TGAAAAAACG 
AATGTGGCGC GCGGTGCGGA CCCTGCTTAT CATCTGTGCG GGGGGAACCG GAGCGCTGTG 
GGCGCATCCG CACGTTTTTA TCCGCACGAA AGTAACCTTT CAGTGGCAGA AGGGGGTGCT 
TCAACGCGCG CATATTACCT GGGAGTTTGA TCCGTTTTTC AGCGCCGATA TCATTAGCGG 
ATACGATACC AATAAAGACG GGCTGTTTGA CAAAAAAGAA ACACAGCAGG TGTTTGAAAA 
TGCCTTCATC CATACCAAAC ACTATTCTTT CTTTACC TTC ATCCGTTCCG GGGAGTCgCA 
TGCGCGACGT gCTCGCTCTC AAGCAGCACG TACAAGTCCC CAGTCAGTGC AGCATTTCTC 
GGTCAGTCAG AAAGACGGTA CGCTGTCTTA TCACTTCTCC ATTGACCTTT CTAGCTACCA 
GCACGCTAAG TCCGCACCCC C AGGAACC CG GCGAACACTG TATCTTGCAC TCTATG AC C A 
CTCATTTTTC TGCGAC TTTC GTTATGCAGA ACACGACACC GTACGCTTTG TGTGCGATAA 
GGCGCGCGTG CAGCCTTCCT ACGAAATTGT TGAAAACCGA ACCGCTCCTG TGTACTACGA 
CCCCTTCGAT AGCATAGAAA GCACTCCCCA ATACGAACAC TGGCGTCCCG GTCTGCATAC 
CTACTACCCA AAAGAGATTC TCCTGCGCTA CACTGCCCCC TAAGGTCCTT TTCCAAGGGG 
AGTTGAGAGC GTATGAAGAA AGTAGGGGTk cgCGTTCGCG CGTGTATCCT GTGCGCGCTT 
GCCGCGTGcG CCACAGGCGT CCTTGCTAAT CCTTTTTTTG GCGgcGCTCC CGCGCGCCCG 
CGgAGGCAGC GCACCCCGGA GCTTTTkcTG CGCAGATACG CGCTCGTCCA TCAACGCCTC 
GGTGCCGCCA TAGTACAGTG GAGCAAAACC CATTCAACAC GCGCGTGGTG GATTACTGTA 
ATGCTCTCCT TTGCGTATGG CGTTCTGCAC GCCTTAGGAC CAGGACACAG AAAGGCAGCG 
CTTTTTTCTT TCTACCTGGG GAGGAACGCA CCTGTGTGGG AACcTGCGCT CACTGCAGCG 
TTACTTGCGG CGTTGCATGG CGCAgcTTtC CCTGCTCTTG CTTTCTGCAT TTAGAGGTGT 
TTCCGGCGCA ATCGGTGCAC AC AGTGC AC G CACAATGTGG TACATGGAGG TGGGTTCCTA 
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CGGATTGCTC ACCTTCTTAG CGCTTTTCTC TCTCGTGCAT GAGCTGATGC ACCTTTTCCc 11700 

TTCGGGCGGG CGCTATTTCT CCTGCGGTTG CAGCGCGCAC ACTGCCGTGT GTATGCGGAC 11760 

AGGAACAGTC GCCCACATGC AGTGGGGTAC TATGCTCTTG AGCGGTTTAT TTATTTGCCC 11820 

TGCTGCGTTG TTTGTGATGA TTCTGGTGCT CAGCTTAGAT GC AG TTGG AC TTGGCGTCGC 11880 

AGCGGTGCTC AGTATTTCAG CGGGGTTAGC ACTCCCCCTG ATGGCTGTCG GTTATTTGGC 11940 

CTGGGCGAGC CGGGCAGGTA TTTTTTATCG CATGCAGAAG AACACTCGTC ATGCACAAGC 12000 

GGTGCTCTCT GTCGTGAGCA TTACC TCAT A CGGAATTATG CTCATCGTCT GTACTTCAGC 12060 

GCTCGTAGCT TCACTCGGTT GAAAGGAGAA TGTACCTCCG CTATCTAGGT GACACTGCCT 1212 0 

GGATAAAACC ATATACCTAA CACGTGGTGA ACGGAAGTAC GCAGTATCTT GCACACGTCG 12180 

GTGAGCTCAG CTTAAAGAAG GGGAACCGTA GACAGTTTGA AGTGCAGCTT GAGCGCAACC 12240 

TCACGCTCAT GCTACGAAGC ATAAAC CCTC ACGTTACTGT CCGCGCAGGC AGGCTGTATC 12300 

TGTCAGTCCC GGCCTCCTTT GAAGCACAGA CCACCGCTGA GCAAGCCCTC TCGTAC CTGC 12360 

TGGGAATTAC CGGTTGGGCT GCTGCTACGG CGTGCCCCAA AACTATGGAA GCGATCACAC 12420 

GGTGTGCACA TGCTGAGGCG ACGCTCgcTG CGCGCGAAGG AAAGCGAACA TTCAGAATAG 124 80 

AGGCgCGGCG CgcGGaACAA ACGCTTCTGC CGTACCTCGA GTGAGATTGC ACGGGAAGTC 12540 

GGCGCGGTTA TCCACCAATC AGGCGCTTTG TCCGTGGATC TCCATCATCC TGACGTGGTC 12 600 

ATTTTCATAG AAGTGCGCGA GCGCGAAgCC TTTCTGTATG GTGCCCGACG TCGCGGCCTG 12 660 

CGTGGTTTAC CCTGTGGCGT CTCAGGACGC GGGCTACTCC TGTTATCCGG CGGCATTGAC 12720 

TCCCCGGTAG CCGGGTACCG AATGCTTTCT CGTGGCATGC ACATTGACTG TCTGTATTTC 127 80 

CACTCTTATC CCTACACCCC TCCTGAAGCA CAGAAAAAGG TTGAAGACCT GGCAAAGGTA 12840 

TTGGCGCGCT ATGGACTTAG TACCACGCTG ACAGTCGTAT CGTTGACAGA CATTCAAAAA 12900 

CAGCTCCAAA CACACGCCCC TGCCCCTTCC CTCACACTGT TGCTTCGTAT GTGCATGATG 12960 

CGCATTGCAG AGCACGTAGC GCGGGAACAG CGCGCACGTT GCCTTATCAC TGGAGAAAGC 13020 

CTTGCACAGG TAGCAAGTCA GACGCTTGAG AAC t AACGGT GACCAGCGCG TGCACGCATC 13080 

TGCCGATATT CCGCCCGCTC ATTGGTGCAG ATAAAGAAGA TATTATCCGC ACCGCCACAG 1314 0 

AAATCGGTAC GTACGCCATT TCTATCCGTC CGTACGAGGA CTGCTGCACA CTCTTCGCAC 13200 

CAAAACACCC AGTGcTTCGC CCAGAGGTAG AAGAAATGCA AAAACAATAC CAATCTCTGA 13260 

TGCTCGGTCC ACTGTTAGAA GACGCGTTCC GGACGCGCAA ACGCACGCGC ATATACGGAA 13320 

ACTATGGGGT ACAGGAGTCA GGCGAATGAG TACCGCTTAT CTTACGCGGC AGCACCGTCC 13380 
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GCCCCTTCTT TAtGACGCGT TTATGCTAAC CAtCAAGgAT TATTcCACCA 
GGTAAATTCC AGCGCCTGTG CCACCTGcGT ACCAGGGCCT GAC TCTCCG A 
CACAAGGCAT TTTTCCCGCT TTGCCCACGC TCCCCAGCCT TGATACACCC 
CACTACAACG CGTGCTCCTC CCTGTATGCG CCGCTGCACC TCGTCCCcTG 
ACGCTCCTTG C AC AGT AC AG ACACCACAcG CACACGTCTT TTACTcAGTG 
AACGCCAAAT CCACCTCAGA GCCACTTGCC AAGACAGTCA GCTCAGGCGT 
TCGCGCACTA CATAGGCCCC CGACTCCTCC ACCGTAGAGC GCC ACGAACT 
TCAAAAACCG GCACGTTCTG CCGACTCAAA ACGATACACA CAGGACCAcT 
AACGCTATTT TCCAAGCTTC AAACGTTTCT TCTGCGTCAG CAGGGCGCAG 
TTGGGAATCG CACGCAGCGC AGCGAGCGTC TCCACCGGTT GGTGCGTCGG 
CCTACAAAAA TAGAGTCATG TGTTAAAACG AAAACAGAAG GGATGCGCAT 
AGACGGAGCG CAGGGCGAAA GTAGTCTGAA AAAACCATAA ACGTAGCGCC 
AAACCGCCGT GCAACTGCAT TCCGTTCACA ATGGCTGCCA TGGCAAACTC 
AAATAACAGT AGCCGCCTGC ACGATGCTCT GCAGAAAATG GTCTTAACGA 
ACCGCATTCG GCCCGCGTAA ATCTGCAGAG CCACCTACCA GATTCGGTAG 
AGCGCGTCGA GCACCTTTCC AGAAGCAGTC CGAGTAGCAA GTGACGAACC 
TGGGGACAGA CAACACGAGC TAGCTGCGAA GTACTTAnCC CTCCGGGAAC 
TCCCAGTCAG CACGTTTTTC AGGAT A t GCG TGCTCCATGC TTCAAAGAGC 
AGTCCTCGAC ATGCGCACAT TCACACTTTC GTTTCTGGAG AACAGCGGTA 
CTACAAAAAA AGAGCACGCA GGATCAAGTC CCAATGCCTT TTTTGCCTCT 
CTTCCCCAAG CGGGGCGCCG TGGGCACGCG CGCTCCCTTC AACGGTAGGC 
CAATAATCGA ACGCaGGATA ATGAGAGAAG GCCGATCGTC ACGCTTTGCA 
GATCCATAAT ATCCGTATAC GAATACATAG AACCGCGCAg CACCTGCCAG 
CGTAGCGCTT AGCCACATCC TCGnTAAAGT CAGATCGGTA G ATnCGTC TA 
GT 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16710 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



GCGCTTCGGC 
AACgGTCGAG 
CTGCCTCAGC 
CTGCCTCAAA 
CGCGGCACGC 
AGCACCCCCT 
GTCACTTTTC 
GCGGTGCAGC 
AACAAGCACG 
CCCATCTTCT 
GAGCGCCGCA 
AAACGCACGC 
GCGCACACCA 
AGAGACCGCT 
CACAGAGCAG 
CTTCTCAAAA 
AAAAGCAGCG 
TCATTCCACG 
AGCTCAGGCG 
CTCACCCCCG 
GCACCCTTTC 
CACGCAgTGA 
CCATACGCTT 
TGCTGATGTG 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



TGCATCnGAG ATACACAAAC nGTTTCTGCC CCTTAAAGCT TCAACTGCAA gTACTTTTAG 
CTGAATGACG TTGGCGCATT CTCGCGCGAG TGTTTCTTCC GTGCGCGGCG TGATAGCTTC 
CGGTCTGAGG CCAAAGACTA CCTGTGTGCC AATATAG TTT TTTAACAAAA AGGACGCGGG 
GATATCCGGT CGAAGAACAA AAAGACCTGC ATCAATTTTT ATCCCATGCT CATCTTTAAC 
AATCGTAACA GGGAAACAAT TCATAGGAGG AGAACCAATG AATTGTGCAA CGAACGTGTT 
CGCAGGATGC TGGTAGATAT GGAGAGGAGA ACCAATCTGT TGTACGCACC CGTCTTTCAT 
GATGACAATC TTATCTGCCA TTGTCATCGC TTCCATTTGA TCATGGGTGA CGTAAATCAT 
CGTGGCCTTT AGGCGCTTGT GAAGAAGAGA GATTTCGGAT CGCATTTGCA CGCGCAACTT 
TGCATCTAAA TTTGACAATG GCTCATCAAA GAGAAAAACC TTAGGATTTC TGACAATGGC 
ACGTCCAACT GCAACGCGTT GTCGCTGTCC CCCCGAAAGT GCTTTGGGTT TTCGGGCAAG 
CAGTGGTTCG ATATCAAGAA CACGCGCTGC TTCGTGGACA CGGCGGATGA TTTCTTGCTG 
AGGGATTTTA CGGATTCTAA GGCCGAACGC CATGTTGTCA AAAACGTTCA TGTGTGGGTA 
GAGCGCGTAg TTTTGAAAGA CCATCGCGAT ATTGCGATCT TTTGGTGTAA CGTGATTCAT 
GTGCTCACCG TCAATGTAGA GGTCAC CTGA GCAGATATCT TCAAGCCCTG C AATG AT AC G 
TAtGCAGTTG ACTTGCCGCA TCCAGATGGT CCGATGAACA CCACGAACTC TCCACTTTCT 
GCGGTAATAG TTACGTCTTT TACTGCATGG ACGCATCCGT GATACGTCTT ACAGATATGC 
TTGAGTTCAA CCTTTGCCAT AGCGTTTACA TTCCTTTTGA AACACGGGTG CGCAACACAC 
TACTTTCCTT ACGCAAACGG GAGGTGGTGT TGTCATGTTA CGCGCTCTGC ATGTGGTGCA 
AGCGGTCGTC TAGATATGCG ACAAGCGCTT CGGTAAACAA ATCTTGATTT ATCAATTCGT 
GTGCAAAGAA ACGGAAGCCG ATGGCACCGG CGTGTCCACC GCCGTTTTGA ATAGCAAAAT 
GAGATAGCAG TAAACGCAAA TCCAACTGTT TCACCCGCGC GGCAGTGCGC ACGCGCCACT 
GCGTGACATA AAATTGCTGT GTTGGATCTG GATACGCGGA GATACCGATA CCTGAGATAC 
TCTCTGCAAT GTTGCTGGTC GCCATTTTTA TCAACGAAAC GAAATGTTCT GCATTGCAAC 
ACTCGTGCAA CGCCTGTGTT TGTGTCTGAT TGAGAAGGAG CAGGGAAATA CTGCCGCGGT 
GTTGCACGTT TTTCACCATT GTTTGGTAAA CCGCATGTTC TTCAGCGGAC AGAGACTCCA 
GGGTATGGAG GATTTGTTTG GTGGATGCTA TGTTCCCGGT AATCGGTTTT GTTTTTTCCC 
TGAGCATAGT GTTCAGCCTC TGGGTGAAGT AGGTGTAGAG CGCGCGGTCT TTCCGACTTA 
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TCAAATAAGC ACCTGTCTTT GCATCCCCTA TC AT AC C GG T 
TGCGTGAATA CAAATTGCGT ATCCCAAGCG CAGTgcGCTG 
TGTAACAAAG ATAAGCGATG ATTTCGC AC G TGCTAGAGGC 
CTGGATCACC GCAGCAGGCA GCGTTTGCAA AAAGGTGATG 
TAGTCGAATC TGAAAGGAGT AGTCGGCACG ACGGAGGCGC 
GGGTGTCTAG AATGACCAGG GCATCCGGCA TACGGGGGAC 
CGGCAATTCC GTTATAGAGA CAAATATCAA TGAGGAATGA 
CTTGACAACA AATTTCCACG CGTTTATTGC ATCGCGTGAG 
ACGATGCGAT ACAGTCTTCA TCAGGATGTT CATGTCCGAG 
TCGCAATCTC CTGCAGAATA TTTCGGACGA CGGCATTTTT 
GTTTTGGCGA CGGAGAATAG GTGTAGTCAC cACTAGCGCG 
ATAGAGTGTC CGCTGGTGTG CGGCAAATGA GACCGTTCTC 
GTTGCTCCCT TGTTACTCGT GTGAATTTGC AGTACCGTTG 
GCGCAGGAGT GATTGTGTGT GCGCCAGACG TGACGGAGCC 
TTTTGCATGA TGACGTTTCC CGAGGCTGCA TCGTTTTTGC 
GCGTGCC TGC GCGTCTTGCG TATATGAACG GACTGCAGCC 
CTTGCGATTG CGTTGGGGCA TGCTGCAAGA AGCGTACACA 
GTATGGAGAG GAAATGAAAA TTATCATCAG TGCTTCTGTG 
CTTTGATTTA GCGCGTAAGc GGCGTCACGA GTACATCACC 
TGCCCTTGCG CATCCTGCTG CGTTGGAAAT TATCAATCTC 
CATCCATAGT AATCTGTCAG AGTTTTTAAA AACACAAGTT 
TCCTTCTCAA TCACTGGGTT TTCAGCATCT GCTTAAGCGT 
GAATAAAAAA AGTGCTCTTG AAGTCGGAGA TCTGTTGGTA 
AAACTATGCT TCGTACTACA TGCGTATGTC GGGTATGAGT 
AATAGCTCGT GTCAATGGCA TCCGACACGG GGATAAGAAT 
GCAAGAAAGG TATTCAGAGT CAGACGACGT TGGCGAATCT 
CGATGGAACA GAGGGGGATG GTAACACAGC GGACGTACAT 
GCATAAACGC ACGGATGCAG ATACGCATCG GTATACGGTG 
TCTCACCGAA CGTGCTCGTC GGGGAGAGCT TGCTCCGCTC 
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GAGTATTGAA AGCACAACAT 1680 

CGCGTgGTTA CGCGCGAGTT 1740 

GCGTGCGATA AGGCTGTAGC 1800 

GTCAAGTTCA ATTTTACGGA 1860 

GTAGATCATA CCGGGATTTG 1920 

AGTTTGTGTA TCGAGATGCA 1980 

AATCTGCACA CGAATGGGAC 2040 

CAGGAGTGCG AAGGC TACTA 2100 

CAAAAGAAAG GAGCCGTGTA 2160 

CGCTGCAATG GAGAGATCTC 2220 

CCCATGACGG CGAGTATACT 2280 

CCAATTATTT TCTGGGTAAT 2340 

GGCCAGAGGC AGCTTGGCGT 2400 

GAGGTTGTAT TGAGTTTTCC 2460 

CGTTCGTGCG GCGCGCAAAG 2520 

CTGCCTTGTG CAGGGTGTGT 2580 

GGGGGGCATC CCCCACGAGG 2 640 

CAAATTATTC TTGATCAGGC 2700 

GCAGAGCATG TTCTTTTTTC 2760 

TGTAGCGCGG ATATCGCGCT 2 820 

CCCGTTGACT TAAGTCATAC 2 880 

GCAGTCTTGT ATTGCGAGGC 2940 

AGTCTCCTCC AAGCGGAGAC 3 000 

ACGGCGCGCT TGATTGAAGT 3060 

GTGTCGATGG GGTCGAACGC 3120 

GCAGGGCACG GTCCTCCGCT 3180 

GTGCACTATG AACACTGCGC 3240 

CTGGAAAAGT ATACGGTGAA 3300 

ATTGGGCGTA CGCAGGAAAT 3360 
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TGAGCGGACG ATTCAGATTT TGTGCCGGAG ACAGAAGAAT AATCCGGTAC ATGTGGGTGA 3420 

AGCTGGTGTG GGAAAGACGG CAATTACTGA GGGGCTTGCG CAACGTATCG TGCGGTGCGA 34 80 

TGTGCCAGAG GCGTTAGAGG GAGTAGAGAT TTTTAGCCTT GATATGACAA GCCTGTTAGC 3540 

AGGTACAAAG TTTCGAGGGG ATTTTGAAGA GCGGCTCAAG CGTCTTGCAG AAGAGTTGGA 3 600 

AAAGAAAACA CAAGCAATTC TTTTTATTGA TGAAATTCAT ACGGTAGTCG GTACTGGCTC 3660 

AGGCGGTTCG GGTGGTTTGG ATGCGTCTAA CTTACTCAAA CCGCTGCTTT CTTCAGGAAA 3720 

GATTCGCTGT ATTGGTTCTA CCACGTATGA GGAATACACC AAACATTTTC GCAAAGATCA 3780 

GGCGTTAgcA CGGcGTTTTC AAAAAATTGA TATTGAAGAG CCTTCTGAGG AGGAAACCCT 3840 

CCGAATTTTG GAAGGGATTC GCACGCTTTA CGAAGACTTT CATGCAGTGC ATTACAGTGA 3900 

TGAAGCATTA GCTGCTGCGG TGAGACTTTC GGTGCAATAC ATCCAAGGGA GACATCTGCC 39 60 

GGATAAGGCG ATTGATATTA TCGACGAAGC AGGCGCGTGT GCAAAGCTAT CCCGGGGAAA 4020 

GCACGGAACA GAGGGAGTGT GTTCAGTAAT TGGGGAGTCG GATATAGACG AAATTGTGGC 4080 

AAAAATTGCG AAAATCCCTA AGCAGCGGGT ATCTGCAAGT GAAATAGAAA AGTTGCGTAA 4140 

CTTTGAGCGC AGTATTTCAG AAAAAATTTT TGGACAAGGC GAGGCAATTG ACTTAGTCAC 4200 

TCGTACgCTG AAGCGCGCGC GGGTGGGATT GCGCGTAAAG CATAAACCTA TAGCAAACTT 42 60 

GCTTTTTGTG GGGGCTACCG GTGTGGGAAA AACAGAGCTT GCGCGGACGC TTGCCCAGGA 4320 

ACTAGGGATT GTGCTGCATC GTTTTGACAT GAGTGAGTAT CAGGAAAAGC ACACGGTGAG 4380 

TCGGTTGATC GGCTCACCGC CCGGTTATGT TGGGTTTGAA GAGGGGGGAT TGCTCACCGA 4440 

CGCGGTAAGG AAACAACCGC ATGCGGTGCT CCTTTTGGAC GAAATAGAAA AAGCTCACCC 4500 

GGACATTTTT AATGTCCTGC TCCAGGTTAT GGATTACGCA ACGCTCACTG ACAACCAAGG 4560 

CAGAAAAGCG G ATT TTCGC A ATGTTATTTT GATAATGACA AGTAATGCGG GTGCCCGGAA 4620 

CATGGGTGTT TCTCTCATCG GTTTTCACAA GGGGCAGGTG GGTACTGCAG TTATCGACGA 4680 

AGCAGTAGAA CGTATTTTCT CTCCAGAATT TCGGAATCGG CTGGACGCAg TTATTCGTTT 4740 

TGATGCGTTG TCCTTGGAAA CGATGGAACG CATCGCCCGC AAGGAGCTTG CCCTGGTGTG 4800 

TGAGCAACTG GAGAAAAAAC ACATTCGTTT TGATATTACC GATGATGCAC TCGCGTTGCT 48 60 

CGCTGAGCGT AGTCACTCAG GGGGAAGTGG TGCGCGTAAT G TTGC ACGCT TGGTAGAGCA 4920 

AGAAGTTGCA AATGTGCTTG CAGATCTTAT GCTTTTTGGA GGAGTCGCTG AGGGGGATGC 4980 

GTTGCGGTGC ACGGTAAAAG ATCGGCATGC TCAATGCAAT TTTCTCCGCA TCGAGTGCGT 5040 

GCAGTCTTCG TATTCGGGGA GTATCCAAGA CGCGCTGGGG TGATGATGCG TGGCACGGTA 5100 
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ACGGTGTATC CGTGTGGTAG GCAGTGCACG TGCACTGAGT TATTCAAACG CGTTTCTCTC 5160 

TTATCTTCCG CCAAAGCTCT ACACTCCATA CCGCCACCTC GTCTATGATC CTGCGTGCGT 5220 

TCCCACCGGC AGTGAGAGTG GTGCCTGAAA CGAGCGGAAC gCaGcGAGCA CCATTCCATT 5280 

TGATTCGTCA AGAACATGCG CAAAACCGAT AACGTCCGTT TCTTGTAGCG GTGCGCGCAA 5340 

AAGGGGAGGG AGTGCGAACG TTATGCTGAT GCGCGTACCG GGCGTATTCA GAACTGGGCA 5400 

CGATGTACAC GAGGGGTGGA GGATTGGTCG CAGTGCCCCA GGGCGCTTAC TGCCCAGGAC 5460 

GGGAATAGCA CAGGGAAGCG CATCACTGAT AGCCGTGCGT ACATCGGTGC ACTGGAAATG 5520 

GGTGAACGCC CAGTATAGAA GGGTAGCGCC GTCACGCGCG CGGATGCgCT TTCCTTCGGG 5580 

AATGGAATTC CCTGCGCCGC CTAAGATAAC TGCAATGATG CGGGTGTCTC CGCGCAGACT 5640 

GCTGAGCGCG ACGTTAAAAC CGGATTCGCG GATATAGCCA GTTTTTAATC CGTCGCAGCC 5700 

GGGCACTGCT GTGGAAACGT TTCCGCGCGC TGCGGTTGCT GCAAGCAGGG TGTTGGTTGC 5760 

AGGGAAACGT GTGTGTGGGG GTATTCTCGA AGTAGCGGGA GAGTTACTAC GCTGAAAGTA 5820 

gCTGCGCGCA TGAAAGCGTG CAAGGTTTTC AGGCCATCGG CGCACATACT GGCAACAAAA 5880 

GAGCACAAAG TCACGCGCAg TAG TTACATT GTGTTCGCTC AGACCGCTTG GTTCCACAAA 5940 

GCGTGTGCGC GTAAGACCCC ATTTCTGTAC . AAGCGTGTTC ATGCGCGTGC AGAACGCCGG 6000 

TATACTGCCT GCAACTGCAT AAGCGAGGGT GTAGGCTGCA TCGTTCCCCG AAGCGATGTT 6060 

CATACCTGCG AGTAAGTCGT GTACGCTGAT GTACTCACCT GTACGTAAAA ATATAAGCGA 612 0 

GCTGCCCGGT GCAAAGGCCT GCGCGCTACC TGCAAGCGGT ACGCGTATAC GCTGTTGCCA 6180 

GTGGAGTTCT CCACGCTCGA GTGCTTCCAT GACCACTGCA CAGGTAACCA GTTTTGCCAA 6240 

GGACGCCGGG GGGAGGGGTA GGTCTGCGCA GAAGGAGGCA AGGAGTGTGC CGCTTCCTCC 6300 

TTCGGCGATG GCGTACGCGC GGGCACTGAT GGGGGGTGGG gTGGATCCTG CAGATAGATT 6360 

GGTAAATTGA AGCG TACGGA TAGGGTGAGC AGAAAAGGGA TTACGGGACG ACGGAGAAAA 6420 

ACGCAkTgTT GGGGAGAAAA GAAAGGAAGG GTGGAGTACt CkCTGCGCGT GCCACTGCAG 6480 

CGCCCCTGCC CCGACTACGC ACGCACCTAA CGCATACAGC GCGCGCTGCC CACGCCGCCG 6540 

TGCCACGGCG CGCAGGGAGC GCGATGAGTG GTCTTTC AAA CGGGCAGTAC ACGCGTGTAC 6600 

CGGGTGTCCC GGCACACCGT ATTAGGGACA GAGCG TACGG CACGGGCCGA CCGCACATGC 6660 

GTCACCCCTT GAAAAGCACG CCGCGACACG TCCTGTGTAT CACAGAAAAC ACCACACGGG 6720 

TCATCATACA GCCTCCCGGC AGAGCATGTT GTTTGCATTA CTTTAGTATA GCAGAATGCG 6780 

AAGTGTGCAG CGAAGGATTC ATCAATCCTG TTGCGTTCTC TTCTTTTTTG TGAGGCATAT 6840 
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ATTGCACCGG 


ATGCTCCTTG 


CGTCTGCCTG 


CCCTTTTGGG 


CAACACTTGT 


TGCCAGAAAC 


6900 


CTGTCTTCAC 


GAAgTCTTGT TTAAATCAGA 


GTAGGAAACG 


CTATGACGCG 


AAAATTAATC 


6960 


ACCGCCGCAC 


TCCCCTATGT 


GAACAACGTT 


C C AC ATTTGG 


GAAATCTTAT 


CCAGGGTCTT 


.7020 


TCTGCAGACG 


TTTTCGCACG 


TTTCTGTCGG 


ATGCGCGGCT 


ATCACACGTG 


TTTTGTATGT 


7080 


GGTACCGACG 


AATACGGCAC 


GGCAAGCGAA 


ACCCGTGCGG 


CAGAACAAGG 


TCTCAGTCCT 


7140 


GCACAATTGT 


GTGCGCACTA 


CCATGCACTG 


CATCGCGACA 


TCTATCAGTG 


GTTTGATCTG 


7200 


TCCTTCGATT 


ATTTTGGGCG 


CACTACAAGC 


GATGCGCATA 


CTGAGcTTAC 


GCAAGCGTTG 


7260 


TTTCGTCATT 


TGGATGCGCG 


GGGTTTTATC 


AGTGAACATG 


AAAGTGCGCA 


GgCGTACTGT 


7320 


CTGCACTGTG 


CACGGTTTCT TGCTGATCGC TATTTGCGCG 


GTACCTGTCC 


CCATTGCCGT 


7380 


AATGCTGAGG 


CGCGTGCTGA 


CCAGTGCGAG 


CACTGTGGAG 


TGCTCCTTGA 


GCCGGAAACG 


7440 


CTCCTGAATG 


CGCGCTGTGT 


GAGCTGTGGC 


ACGGCGCCGG 


AGTTTCGCCC 


TACGCGTCAT 


7500 


TTGTATTTAA 


ATTTGGCTGC 


ACTGGAAAAA 




CGTGGTTTTG 


C AC C ACGAAT 


7560 


CATCTGTGGA 


CTAAAAACgC 


GGTGC gTATG 


AC IvjAAWj 1 1 


GGCTACGTAC 


GGGATTGCAG 


7620 


GAGCGTGCGA 


TCACGCGCGA 


TCTGCGCTGG 




TTCCCAAAGC 


AGGATTTGAG 


7680 


CAGAAGGTAT 


TTTATGTGTG 






ACATTTCCAT 


TACTAAGTGC 


7740 


GGCACAGAGG 


CAGCTTCCTC 




flf^ttfiGGACCG 


ACGATGGCGT 


GAAAGAAAAA 


7800 


TGGCAGTCTT 


GGTGGCTTGA 


TCAGCAGGAT 


GTGGAGTTGG 


TCCAGTTTGT 


GGGGAAGGAC 


7860 


AATATTCCCT 


TTCATACGCT 


GTTTTTCCCC 


TGCATGCTCA 


TCGGTTCGGG 


GCAGCGGTGG 


7920 


ACGaTGcTTA 


CGCGTCTTTC 


TGCGACGGAg 


TATTTGAATT 


ACGAAGGGGG 


aAGTTTTCTA 


7980 


AGTCTTTAGG 


GGTGGGCGTT 


TTTGGTTCGG 


ATGCAAAAGA 


ATCGGGCATT 


CCCTCAGATC 


8040 


TGTGGCGTTT 


TTATCTCCTG 


TACCATAGAC 


CGGAAAAAAG 


CGATGCGCAC 


TTTACCTGGC 


8100 


ATGAGTTTCA 


GGAGCGTGTA 


AACAGTGAGT 


TGATTGGTAA 


TCTGTGTAAT 


CTGGTCAATC 


8160 


GTACGCTCAC 


CTTTGTGGCG 


CGTACGTACG 


GGGGCGTGGT 


CCCTGCGCAA 


GATGGAGCGC 


8220 


GCAgCACCCG TGCGCAGGTG 


ATGGAAGAAA 


CGCTTGCGCT 


CCGCGAAGrt 


GCGGgAATAC 


8280 


TGCAAAGCGC 


ATGACAGATT 


TAATGGAGCA 


GGTACAGTTG 


CGAGAAGCGT 


TTAGAGAAGT 


. 8340 


GTTTGCGCTC 


TCAGCGCGTG 


CGAATAAGGC 


GTTGCAGGAT 


GGTGCACCGT 


GGAAAACGCG 


8400 


GGCGCAGGAC 


CGTGAACGTG 


CAGACGCCTT 


GATGCGTGAG 


TTATGCTATG 


TGATTCGGGA 


8460 


TGTGCTGATT 


TTAGCGCATC 


CTTTTTTGCC 


GTGGTACACG 


CAGCAAGCGG 


CCCGATTTTT 


8520 


. GGGTGTTCAG 


TTGTCTTCCT 


GTGCACCAGA 


GGGGGGAGGA 


GCTGTGTGTG 


CTGCGAAGAA 


8580 
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AGACGCGGAT ACGGCGCAng AACnACAGTG CAACCGACCC TCCGATGGTC AGACGTGGGA 8640 

GAACGCAAGG GTTTAACGCA gGTGCATCCG CCGGTGATTT TATTCCGTCC GTTGGAGACG 8700 

GAAACTATTG CTGCGTATCG TGCCCGCTAT GCTGGAACAc CAGGGATGGG GCAGGAGTGA 87 60 

GCGTACCGCG CACTGCACAG ATGCCCACGG GAATGAATAA GAAAGAGACA GACGCTCAAC 8820 

AAAAGAAGGA GGAGCGTGAA ATGCCCCCTC CCTCAGATAC TGCACGGTTA TCTGCATTTT 8880 

TTTCTGAGCG CGTTGTACTG AAAGTAGCAC GAGTGTTGCA GGTGGAGCGT C ATC CGAATG 8940 

CGGATATGCT TTTTGTTGAA ACATTAGATG ATGGCTCTGG CGTTGAGCGC GTTATTGTTT 9000 

CTGGTCTTGT GCCTTATATG GCTGCAGATG CGTTGCGTGG TGCGCACGTG CTTATTGTGG 9060 

ATAATCTGCA GCCGCGCTaC TGCGTGGGGT ACGGTCTTGC GGCATGCTGT TGGCCGC AG A 9120 

GTATGTAGAT GCGCAGGgCA CAAAGGCAAT TGAATTGGTG CAGGCGCCAT GGGCTCTGCC 9180 

CGGTGAACGC GCAACACTTG CGAGTGCGCC GCCGGTCATT ACACCGCACG GGTCTGCCGT 9240 

TATCGATGCG GACGCTTTTT TTTCTGTGCC TATTCGTGTG GTAAATTATG CAGTAGAAGT 9300 

TGCAGGTGAG CCGCTCATGG TTGGAGGAAG GCCACTGGTA ATGCAGCGAG TGAAAGAGGG 9360 

AACTGTCGGC TAGGAATATT CACAGAGCAT TTGGTTTTCC GTGTCGGATA GGGGGAGCGC 9420 

Ag c ATGAACG TGGGATTTTT GGGTTTTGGA GCAATGGGAC GGGCGCTGGC AGAAGGGTTG 9480 

GTGCACGCAG GAGCGCTGCA AGCGGCTCAA GTGTACGCCT GTGCGTTAAA TCAGGAAAAG 9540 

TTGCGTGCGC AGTGTACATC TTTGGGCATA GGTGC CTGCG CGTC AGTTC A GGAACTGGTA 9 600 

CAGAAAAGTG AATGGATTTT TCTTGCAGTC AAACCATCTC AAATCAGCAC GGTACTGCGC 9660 

GATCGCCAAT CCTTTCAGGG AAAAGTGCTT ATTTCCCTTG CGGCGGGTAT GTCTTGCGC T 9720 

GCATACGAGG CATTGTTTGC CGCGGACCCT CATCAGGGTA TCCGTCACCT GTCACTTTTG 97 80 

CCGAACTTAC CTTGTCAGGT GGCGCGGGGG GTGATCATTG CAGAAGCGCG CCACACCCTG 9840 

CACCACGATG AgCACGCTGC GCTTTTAGCA GTGCTGCGCA CAGTTGCACA GGTAGAGGTG 9900 

GTGGACACCG CGTACTTTGC GATCGCAGGG GTGATTGCAG GCTGTGCTCC GGCGTTTGCC 9960 

GCGCAGTTTA TAGAAGCGCT CGCTGACGCA GGGtGCGCTA TGGCCTGGCG CGCGATCAAG 10020 

CGTACCGGCT TGCGGCACAC ATGCTTGAAG GGAC TGCAGC GCTCATACAG CACAGTGGTG 10080 

TACATCCTGC AC AAC TT AAA GATCGCGTGT GCTCTCCTGC AGGGAGTACT ATTCGCGGGG 10140 

TGCTTGCGTT AGAGGAGCAG GGATTGCGCC GTGCAGTTAT ACACGCGGTG CgCGCTGCGC 10200 

TCAGTTCTTC CTAAGGGGTG GGCAGGGTGC ATTGCTTGTT TTTTTTGACT GCTGACAGTA 10260 

CAGTTGCACC CTTGTGAAAA GTTCGTGCGT ATATTGGCGG ATCGGGGTTC TCGTTTGTAT 10320 
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TCTGTGTGGA GTGGGGAGCT GTGGCGgTCG TGCGCGCGTG CgcGAGTATT CGCGTGCGGA 10380 

g c TTGTTATC GGTACGCTCT GTCGCGTGCG CGTGTACTCT AAGCGACCTG CTGCTGAAGT 10440 

GCACGCGGCG CTTGAGGAGG TGTTCACGCT GCTACAACAA CAGGAGATGG TGCTGAGTGC 10500 

TAACCGTGAT GACTCTGCGC TTGCTGCCCT AAACGCTCAG GCAGGTTCGG CACCGGTTGT 10560 

TGTTGACAGG TCGCTGTATG CGTTGCTTGA GCGTGCGCTT TTTTTTGCAG AAAAGAGTGG 10620 

GGGTGCGTTT AACCCCGCAC TAGGTGCGgT AGTCAAGCTT TGGAATATTG GCTTTGACCG 10680 

TGCTGCTGTC CCTGACCCCG ACGCGCTCAA GGAGGCGCTG ACACGTTGTG ATTTTCGTCA 10740 

GGTGCACCTG CGCGCTGGGG TATCGGTGGG CGCGCCACAC ACGGTACAGT TGGCACAAGC 10800 

GGGCATGCAG TTGGATTTGG GCGCCATTGC TAAAGGATTC CTTGCGGACA AGATTGTACA 10860 

ACTGCTCACT GCGCATGCTT TGGATTCAGC GCTCGTTGAT CTGGGAGGAA ATATTTTTGC 10920 

CCTTGGTCTT. AAGTATGGAG ATGTGCGCTC AGCAGC cGCG CAGCGGTTGG AATGGAACGT 10980 

GGGTATTCGC GATCCGCACG GCACGGGGCA GAAGCCTGCA CTGGTGGTGT CGGTGCGCGA 11040 

TTGCTCGGTG GTGACTTCTG GTGCGTACGA GCGTTTCTTT GAGCGTGACG GGGTACGCTA 11100 

CCATCATATC ATCGATCCGG TTACCGGGTT TCCGGCACAC ACTGATGTGG ATTCTGTGTC 11160 

TATCTTTGCA CCCCGTTCCA CAGATGCAGA TGCGCTTGCT ACCGCCTGTT TTGTATTGGG 11220 

GTATGAGAAA AGCTGTGCGC TCTTGCGTGA ATTTCCCGGT GTTGACGCGC TGTTTATTTT 11280 

TCCTGAcaaG cgcGTGCGCG CAAGTGCaGG GATTGTCGAT CGCGTGCGTG TGCTCGATGC 11340 

ACGTTTCGTG TTAGAGCGTT AGGACAGCAC GTGTGCTGTT CGTGTGTAAA AAAGTGTGGC 11400 

GGACTGTCCT CATCATGGTG TGTGTGCAGG ATGCGTGCGC GGGGGTTCGG TCAGATGTCA 11460 

GGGTGTAGGC AAAGATGAGC GCAGCGCTGA CAAGAGGTGT TGAGTGCACC CTTTACTCCT 11520 

AGGTTCAGTG AGCTGCGTAA TTTTGAATCG AGGAGTACAG TGATGGAGAC GTTTTTTACC 11580 

TCAGAGTCTG TGAGTGAGGG TCATCCTGAT AAGCTGTGCG AC C AGATTTC TGACGCTGTT 11640 

CTTGATGCCT GTCTTTCGCA AGATCCTCAC AGTTGTGTTG CGTGCGAAAC TTTTGCCTCC 11700 

ACGTCCCTTA TCCTGATTGG AGGTGAAATT AGCACGCGGG CGCATATTAA TCTTACCCAA 117 60 

ATTGCGCGTG ATGTTGCCGC TGACATTGGA TATGTAAGCG CTGATGTCGG TCTTGATGCA 11820 

GCGTCCATGG CTGTTCTTGA TATGACTCAT CATCAGTCGC CTGATATTGC GCAGGGGGTG 11880 

CACGGTGCAG GACTGAAGGA GTTTGCAGGA TCGCAGGGGG CAGGGGATCA GGGGATTATG 11940 

TTTGGTTTTG CGTGCCGCGA GACGCCGGAG TTTATGCCCG CCCCCCTCAT GTGCGCGCAC 12000 

GCGgTTGTGC GCTATGCTGC CACGCTTCGT CATGAACGCC GTGTGCCGTG GCTGCGTCCT 12060 




13041 
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GATGCAAAAA GTCAGGTTAC CGTACAATAC 
GTTGTGTTTT CTCAGCAGCA TGATCCGTCA 
ATAGAGGAGA TAGTGCGTCC GGGGCTTGCA 
TTTTTTATCA ATCCAACCGG TCGTTTTGTC 
ACCGGGAGAA AGATCATCGT AGACACGTAT 
TTTTCAGGTA AGGATGCATC TAAGGTAGAT 
GCAAAAAATA TTGTGGCAGC CGACCTTGCT 
ATCGGCGTAC CATATCCGGT TTCGCTGCGG 
GAG TC AC AC A TCACACACGC GGTGAAAGAG 
CGCACGTTGG ACCTGTGTGC GCCTCGGTAC 
CGCGAACAGT TTCCTTGGGA ACGCACAGAc 
CCGTTCGCGC TC TCTGGCC A GATAAAAGAG 
CCTGTATCGT TACAGCCCTT CACTTTCTGC 
TATGGAAAAC CCAAGGGTAT GGACCTGCTG 
GGGGTCATCG TAGTGCGTGT GCAAAAAGTG 
CGCGCGTAGG CgCAGACCAC GGCGTACTGT 
CCGGACCTCT TTGAAGAACA ATTC GCGAT A 
AACAATGTGT GCGATGGATT CGACCGTCTC 
AGCTCCCTCC TGGTGCACGG GAGAGGAAGT 
TGTTC CTGTG CGTGATGCAC GGGAACTGCA 
AACCCTTTCC ATTCCCGCTG TTTCATCTTT 
AATTTTCTCC GAAGGAAGGG CAGGTAACTC 
GCCCGCCGTC TTGCGCGCAT GCGGAGTAGA 
ACCCTTTTGA TGATGaTgCT CCGCACGTTC 
TCGTTGCAAA TACAGATCAC CACTGATGCT 
CACCAAGTGC AAAATATCTT CGTCTTTCCA 
AAAGATATTG TAGTGATGTG TTTTATACTT 
GGCTCGATCA TCGTATCTGT GCCAGTATTT 
TTTTGTAATA GTTTTCGCGA TTTCAATGGT 
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GAGGGACATC GACCGGTACG TATCAGTGCG 12120 

CCTTCATACG AAAC C ATT AG AGAAACGCTC 12180 

CCTACAGGTC TGTTAGATGA AAACACGCGT 12240 

ATTGGCGGTC CCTTTGGGGA CACTGGTTTG 123 00 

GGGGGAATGG GCCGCCATGG AGGAGGCTCC 12360 

CGTTCTGCAG CGTATATGGC GCGTTATATT 12420 

GAGCGCTGTG AGGTGCAGCT TGCATACGCA 12480 

ATAGAAACAT TTGGAACGGC GCGCGCATCT 12540 

ATTTTTGATT TAACCCCAGC GGGTATCGTG 12600 

CGCTCGACTG CAGTGTATGG TCACTTTGGG 12 660 

TGCGTGTGCG ACTTACAGCG TGCGGTGCGC 1272 0 

TAGCTTCGTT TCTTTTTTGT CTGCGCGGGG 12780 

CCATGTTACG ATGATTGGCT CTAGGGAATG 12840 

GTATTCATGA CTGTTGGGCC ACCGTTGGTA 12900 

ATAGATGGTG TCTTCTGCAT TGTTTTTGcG 12960 

TGcACGGTTG AGCACCGTAC GAATGGCGGT 13020 

CACGCCGTAG TCCTTCCCGG TGATTCTTTT 13080 

TTCCGGTGGG GAAGATCGCT CTACGGTTGC 1314 0 

GACGATATCG GAATGTCTTG CCTCCGCGTG 13200 

CGTGCGGACG TTCTTTGTGT TGTGACGGAA 132 60 

CTGTTTTGAT TCCCCACTAT CTTTTTTACT 13320 

TTTC TTACGC GCACGAGTCC gTGCACGCGT 13380 

AGATCGTCTG TGAGTCGCAG GCACTTTTTT 13440 

TACCAGGCGC GCTTTTAACC AGTGCCAGTT 13500 

GTAGTAATAC CCTGAGGCGA CGTGCGCGAG 13560 

TTGGTGACTC CGTCCAGTTT TAATTGAATG 13 620 

TCTGCTGTAC AGTAGCACGG TGTTCTCAAA 13680 

GATTGCATAC GCAAGTACCC GATCTTCAAT 1374 0 

GCGTACCTGT TCTGCGGTGA ACACCAGTTG 13 800 
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TGAAAAATCT ATCGTTGAAG 

GCGCGCGTGT GTGACTCCTG 
ATGCGAATGA TGTCTGCAGC 
TAATGCGTGT GCTTTCTGGA 
ACAATCTTTT CGCAAACTCG 
GCGGCGGCGA AAAAATCAAT 
GCTAGGGCAT CGGCAAGCAC 
AGGAGTACCC TACGCTCCTT 
AGTGCCTGTC TTTGATCTCC 
TCGCTAAAAC TCAGCTGGGG 
GTGCGCATGA CATCCCAGGT 
GCCTTTtCGA AGCGGTGAGA 
ATTCGCCTGT GCAAGCATAG 
TACCATCTCT TTTGCCATAT 
TGCCGCGACA TTGATACCGG 
AGCGCGCTGC TTATTAATTC 
CTTTTCAGGA GAATCGATAT 
AACTGCAGTC ATGGTCCCGA 
GAACCACATG GATGCAGTTA 
CATGTTCATG CCATTGAACT 
AGAGACCTCT ACCTGAATGT 
CACACTCAGT TCGCGAATGC 
AACCTGAATG AAGGAGATGC 
CTGGTCCGCA TCTTTTCAGA 
CGCAGTCCTG AAGACAACTT 
GTTCTTTGAG AGAACATAGC 
TGGTGCTTTC AAGCGGACCA 
AGGGGTATCG GAATACGCCG 
GTGAGTGCTT GAACGTTGTG 



AGGTATCTGT 
CCTAAATGTT 
CTTTTCCATC 
GAAATGAACG 
GATCAGCGTG 
AATTTTCTTT 
TTCCTTAAAC 
CTGAGGGTAT 
AAGGGATCTT 
AAACTCCGCC 
GCGTAGACAA 
AGAACTAACG 
CAGTCCCAGA 
CCACGTCGCG 
CAACCGTGTG 
TCTTTATTGC 
TCATGACCGA 
TATACGCACG 
CAGTGTTCTC 
GAGCGTGGCT 
AGAGACGGTC 
GCTGGATAAC 
CGTTCTGCGC 
AACTGCAAGA 
CTCAATGTTC 
GCTCATGTTG 
GCCGCCCTGG 
GGTGCACTTG 
CAAATCGGAG 
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ACCGTGAGCC TCTGTGCAAA AGCCGTAGtT 13 8 60 

CGCACAGAAG AAAGCGTGTC GGTAACGTAC 13920 

TCTTTCTCTC GGTAGAGCAG CCATTGTGCA 13980 

GACTCAAAGC GGTTGAGAGA AATAGAACGC 14040 

CGCACGTCGG TCAGGTCGAG GGGCGCAACC 14100 

ATTTCTTGGA TTTTCTTCTG AGCGAGCTCT 14160 

CGCACGTCCC TTTTGCGATA ATAGGACATG 1422 0 

TTCACCAAAA GCGTGCGGTT GTAGATATCC 14280 

AGTTCAAAAT ACCGGTCGAT GTCGGCATCT 14340 

CTCAGTACTT TTTTATCAAG CTCTGTGAGT 14400 

GCTCTTTAGA AGAAAGGCTG GCGCGCAcGG 14460 

CAAGAGGCTT AGAACGCTCT GcGTAgCCTG 14520 

CTGCACCAGA ATCTGGTTCT TGGTGTAGTC 14580 

GATGCGAGAC TCAGCTGCCT GCAAGTTCTC 14640 

GTCAAGTC T A TTCTGG T AGG CACCGAGATC 14700 

CTGATCAAGC GTACCGATTG CGCGGTTGGC 14760 

CTCGTCACCT GCATCCCGAA TTCCCATGGC 14820 

CGTGCGCTGG TCCATGTTTG CACCGATGTG 14 880 

CCCGCCTTGA CGCGCGAAGC GACCAGTGAG 14940 

GGCAATGCGA TCCACCTCTG CTACCAACTG 15000 

TTCTGCGGAG TAGATACCGT TCGCCGCCTG 15060 

GTCGGTGGTC TCCTGTAAAA ACGCCTCCGC 15120 

GTTTGTAGAC GCCTGGTTCA AACCACGGAT 15180 

CCCGAAGCGT CATCCCCTGA CCGGTTGATG 15240 

TTCTGG AC GG ACAAGTTAGT GTGTCCGAGC 15300 

TGATTGATGA TCATGAAGCA TTACTCCTTT 15360 

CATCCCTGCC gTTGCACCCC GTGCTTGGTA 15420 

AGGAAAAAGC GGTGCGTATA TCTTGCGTAC 15480 

GTAGAATCCC CGTCCTGTTG ACCTCTGCAG 15540 
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CAGAGTTACC CCGGTTAGGT TCGTGCGTGA GATAGGTTGC CGGTTGCGTC 
TGCACTGTGG ATTGAGTGGC TCTGTCCTTG TTTGAGCTTG TGCGCGGCGT 
GGCGTGCACT GCCGTCTTAG CTTTCCACGG AGGGATGTGG GAGAAGATAA 
TGGGGAAGGC GTATGAGGTG TATGAAGATA CCCAGGCAAC TGACGAGGCG 
GAGAGGTTTT ACGCGCACCC GTGGGTGCTT GTTGCGGTGC TTAGCgcGCt 
TTTGCAGTCC aGcTACGCAC GCTACGCTTG GACAATAATA ATTTTCGCTT 
GAAAACTCGG TGCGTATCGC CGATCAGCGC ATCGATAGCA CATTCGGCTC 
GTGCTCATTG GTATTAAGCG TGAGTATACT TCCGTCGTTG ATCCTGTCTT 
GTGCGGTCGC TTATTGAACG CATCAGTGCG GTCCCCTTGG TGAGGGCGGA 
TCACTCCTGT CTGCCGAATA CCTTGGTCTG CGTGCAGGAA ATATTATCAG 
GTTCCTGATG AGTTCTCCGG AAGTGCAGAA GAGGTACAGG GCGTTTATCG 
GATTGGGATT TCTATGAATG TAGTCTAGTC TCGCGCGATC TACGCTCTAT 
GTGTTTCTAG ACACCTCCAA CGAAGAAAGT AGTTCACCTG AAGCGATGGC 
GCGATCATAC GCATTCTCGG TGCGTGGAAA AGTCGTGACG CTCAGACTTT 
GTGACTGTTT TTAACGAAAT GGGGAATGAG GCGTCGACGC ACGATTTAAC 
CCGCTTGTGG TGCTCATAAT AATCGTGGCG TTGTTTGTAT CGTTTCGCCG 
ATCTTCTTGC CCC TTTTGAC AGTGGTCATA TCTACCGTGT GGGCCTTAGG 
TTGTGTGCCA TACCACTTTC TATCCTTTCT GCCATCTTGC CTGTAATTCT 
GGGAGCGCAT ACGGCATTCA TATAGTTAGT GCGTATTTTT ACGGCGCCTC 
TGCTCCACCC GGCAGGAGCA TCGCGCTCGC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



CGGCTGTGTG 
AGCTGTACTT 
TTAGGGAATG 
TCGGCTACTT 
GACGCTCTTT 
TATCCCCAAG 
CCAAGTTCCT 
TCTTGCGGAC 
GAGTACTCTC 
TGAGCGTGTT 
AAAACTTCGA 
GCAGATAGTC 
AGCTTGTCGC 
TGTCACAGGG 
GCTCCTGGTG 
CCTGGCGGkT 
AGCTATGGCT 
TATTGCCGTC 
CTCGCGTATC 



3041 

15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16710 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

TCAGCCCGCG CACAGGAAAG TATAnGATCG GCACGTTTCC TTGCCTCCGC ATATATATCT 60 

CTTTCTAACA CcTCTGTTGA ACGCAGCTCT TCCATGTAGT CTATACCTTT GCCCCGATAA 120 

CCTGTCGAAT TGATTCACGC AGGGACGTGC GTACGTTCCC CTGTTCGTGC AAAGCGGGGA 180 
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CTTCTACGAC AAAGGGTAAC TCCCCCTGCA TCTGCCACGC ACGCACCTCC TCTCTCAGGT 240 

AAGACATAGC GCGCTGGGTG AGTATGAGGA CCTGCGCGGG ACGGTCTCCA GAGGGGGGAC 300 

GCGTGACCTG GGAGAATGCG CGCGCCGCCT CCTCTGGAGA ATGTACCACG AAACCGTGCA 3 60 

CCCCTACTAA GGAAAAACCC AACACCAGTT CTTGCTCTCC AATTATGCAA TACGTCACTT 420 

GATGACACCC CAGACGCACG TCAAAGCAAG ATGATAAGCA ATGCGACCAA AAAACCCCAC 480 

AGACAAATGC CTTCTGCAAG ACCGATAAAG GGAAgTGCCT TTCCTGAAAT TTCAGGATCC 540 

TcAyTCATTG CCCCCATCGC TGCAGCCCCG ATTTTACCTA CTGCAAQGCC TCCCCCAACG 600 

CAAGCGAGCC CCACCGCGAG TcTGCGGCAA TGTATTTTAA GCCGCCATCT ACATGAGAGG 660 

GCGGCTGmmT CTCCGCGtTa AGaAGACACG CACAAGCCAG CAAACnCACC CGTAAACCAT 720 

GCTCTTTTCC AACCCATACT AATCCTCTTG ATATCCAAAC CTAAnCGGTG CGAAGACGCT 780 

CCCACTTTTG GTAAAAAACT TTGAAAAAAA CTCGTAGTAT TGCAGCCGAA CCGCTTGAAt 840 

GGCAACGATC AACCCTTCTA GAAAGATAAT GACTCCATTC CCAAACACGT ACACGAGTAT 900 

GCCCCATAGT GAAGCGTAGC aCCAACGAAT TGCGTCATAG TAAACACCAC AAAACTTAGT 960 

ACCGCATGGG ACAAGGCAAA GGCTCCCACG CGCAAAAAAC TCATGGAGTT GGAGAAAAAT 1020 

CCCGACACCA CATCGACCAT TTCGATAACA CCGTGCATTA GATACATGCC AACACCTTCA 1080 

GGAAACCACG GACGCACACG CTTGCACACA CGCTCCAAAA ACTCTTGACA AAAA t ACCCA 1140 

CGaGAGGCAC GCCCTTGCAA CCGCATCAAA GACCCCGAAT GGATTCCAAA AGTGGTATGC 1200 

GCACTGCAAG GGCAACATGT ACCAAAAAAA GAGGG 1235 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16636 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

ATTCTCnGCA CATGTnCCCT GACACTTCCG TAGCGGCTGC CGAGACCTGC AGCCAGAATA 60 

ACTAGCGTGA CGTATTCGCT CATAGAAGTC TTCTAGCAAA ACGGAGCACG CCCCCGGgCC 12 0 

ATCCCCGGGA AAGCGTAAGA GACGCACGGT GCACTTATGA GCGCGCACGC AAGGCCGCCA 180 

TCGTAACGTA TTGTACGGCT TGTTAAAATG CGGCAGAAAG AACATGTCCA GCAGTTTTAA 240 

CTTATCAATG GTCACCCGCT CCTGGATTGC AAGAGAGAAC AGGTGAATAC CCATCGACAT 300 
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GTCGTGCCGT GAC GCCATTT 

CCGGATTTTC ACCGGGTGAT 
ATCCGTTACC TCCACCTCAA 
CACCATTTTT AAGTCGTATA 
TGGGAATCCT GCAGCGTTGT 
CGCAATGTAA GAAGTTTGTC 
GTACACGTCT TTCACACTTG 
GGTTCGCACC TCGTTCTTTC 
CATGTCTGCG GGGTACTCAC 
ACGGAGTTTT TGCACTTTCT 
TCCATGAGTG CACGGAAGGA 
ATGAGCGTTA CCTTTTTTTG 
CCGGCACCGA TAACGGCAAT 
TCAGCATCCT GAAATAGCTT 
GGAATGATAG GCAGAGAACC 
G AAC CGTCTC GTGCAGTCCC 
CTTTCCATGG AAACGCGTGC 
CCCTCCGATC CACGGATTTG 
ATATTATTGT TACGGTCAAA 
TTGACACAGG CGGTTCCTGC 
TTACGTTTGG CATTGTAGTT 
GGTTGTTTTG TTCCCAAACG 
TTCTTCAAAA ATGAGCGCGT 
GATATTCCAC TGCGTTGCCG 
CTCAATGCGC ACCTCAGTAC 
TGcACGGGGA AAATCACCAA 
AGTAAGTTGT ATATCTAACG 
TTTGATGCTG TTTTCTGCTT 
CGCTTTGTCG TACCACGGCG 



454 

GAGCGCCCAC AATCACCCGC GTTTTTTTAT CAAACACAAT 360 

TGTCCACTTC CATAAAGGCG GGTAACTGTG AGTCTTCAAA 420 

GTCCCATGCG CGCGGCGGCT nCCTGCGTCA CTCCTGTGGA 480 

TGTTGATACC GTTGGAGCCC TGCACCCCAA TGCCTTCAAG 540 

GCGCCGCAAC GATACCGCTG CGCATCGCAT TGGTTGCAAG 600 

CGAGCGAATT GTCAAACACC GTTGCACAAT CGCCAATTGC 660 

TTTCTTGTTT TAAATCTACC gCATACGCGC CATTGGCAAA 720 

CCAGTGCAGT ATTGGGGCTA AAGCCAATGC ACACCATAAC 780 

CCTTATCTGT CACCACTGsC ACTAC CTTTC CATTGCTGCC 840 

GGCCAAACGC AAGGGTGATG TGATGGTGCG CAAGTTCTCA 900 

TGCGTCGTAA TAATTGGAAA GAC T AGAGTC CATCGCATCG 960 

GTGGCGCTCA AACGCCTCTG CAAGCTCCAC GCCAATGTAC 1020 

ATTCTTAATG GAAGGCTGCT CGAGTTTTTT AATCACCGCT 1080 

AATGCGCTGA ACATTCTCCA AATCCATGCC GTCGATTTTA 114 0 

GGTTGCAATA ATAAGCTTGT CGTAGG AC TC TGCGATTGCA 1200 

GTACACCTTT TTTGAGGCAA AATCGATACG GGTGATATCG 1260 

ACCCTTTTTT TCCAATTGTT CTTTATTTGC GTAGAATAGA 1320 

TCCGCCAATC CAAAGAGCCA TGCCGCAACC AAGGAAGCTA 1380 

GACTACCACT TCATTCGTGG TGGTAAGGTC TGTGAGGCAA 1440 

GTGGTTCGCT C CGATAATG A CGATCTTCAC GGCGTCCTCC 1500 

CAGGGAAAAG ATTTTTGTAC AGGCGCCTGA AAAC AG CCGC 1560 

CGATAACTGG GAGAATGTTA TTcTGCGGTG CAGGTGGTTT 1620 

GGAGCGCGCG AACCCGGTCT GAGTCTTCTT TTTGCACGAG 1680 

CTTGCTGTCT ATCCTGCATG GTTAGATACA GATGAATCAA 1740 

ATTGGGGCCA CAGGGTGCAC GCCTCTTGGA GCGTAAGCTC 1800 

GTTGAGCAGC AACGAGTGCA ATCAGCGTAT ACACATGCGC 1860 

CCTTCACCAG TAAGGCGTCC GCTTCATGGA GTAAATGCAG 1920 

TGTGCACGAG CATCTGCACG TTCTGCGGcT CCTCCCGGAG 1980 

CTGCGTGTTC ATAGCATCCG TCGGCGTAAA AGGCGTTGGC 2040 
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AAGAAGATGA AAAGCCTGAC CACGTTGCGC AATCACCGCC gCGTCTGCGC GCTGCGTTTC 
CACTTCAAAC AGGGGAACAC AGCaCGCTAG CGCACCGCgc TGGnCGC TGT AGTTCTATAT 
AATTTTCCAA C AC T ACGAGT TCGTGCGGTA ACAGCGCACG CGCGCGTTGT AAAGACTGCG 
CTGCAGCGTc GAAGTTTTGC TCGCGCAGAG CCTTTTGCGC ACACAGGTTA TGCAGCCAGC 
CGTTATCCGG ATCGAGTGCC AGGGCGCGCG CAAGAGGCTC GTCACAATCT TTTTCGC TTA 
AAAAAAGAGA TTCAGCGTAT TTAAAGTGGT AGAGCGCACA GTCCGGC GC A AGTGCACAGG 
CTCTTTGAAA TGCATCACGT GCGCGCTGTT CGCACGCTGC TGCAGCAGGA GCGTCGTGTT 
CGTGCGTGCC CTGCGCTTCT CTCAAAAGCA GGCCATATAA GTACCAGACG GTAGCATCAG 
CGCTCCtGCG CGGCAGAGTG CGTCAAAcTG CGTGTGTGCC TGACGGTGCC GTCCTGTTGC 
ATAATACAAT TTGCCTGCGA TACTGCGAAC TTCTGTACGC TCAGGATCAA GACGGCGAAG 
TGCGAGTACC ACGCGTTCTA AATCAGCGTA GGATTCCTGT GCTAAAAAAA GCCGCGCCGC 
ATGTAAATAC GCCTGAATTG CTTGCTCCTT TTCCCCCAGT AGGGAAAGCT CCTGCGCCGC 
ATGCAGCGCA AAGAGTCCCT GGTGAGGGTC CAAACGAAAT GCACGCTGAT ACGCAGCAGC 
AGCGTCCTCG TGTCTATTTT GTGCAAGTGC AAGGTGACCG CATAGATTGG AAAGAAATGC 
ATCGCGCTCG GCTACGACAC GGTGTGTGGC AAGGAGGTGC GCGAGTTTCT CGTATGCGTT 
TTGTTGGTAC AGCGCACCTC CTAAGGCGAG AAGTGCCCGC ATACAGTGAG GGTCTTGCAC 
AAGCGCTGCG TTAAACGCTT CTTCTGCCTC ATCATAGAGG TGCTGGGCAC ACGCGATGTC 
TCCTGCAAGA AGATACTCGC GTAAACCAAA GGTACATTTG GCTTTTAGTT TTTTCATCTC 
TGcACGAGCG CaCrCTGCGc GTc CCATAGC GTACAGGGAT TCTGCGAGAC GCAGgTGCGC 
GCTTGCAGGG ATACGCGCAG gTTCGTTGGC GCGCAAGCAT GCAGTTTCAA CCATGAAAGG 
TTGCGATTCC AGACTGAATG CTGAAAGTTC AAAAGAACGA TGCGCATGGA CACCCCAGTT 
CTGGGCAGCA AAACAAACTT TCCCTGCCGC TTGTATCGTG TCATCTTCAA TTTGCGCAAT 
CCACTGATCA TCTACACAAA GAG T AAAGC T TGTGCCAACT GCAATGAGGC TCAACACGCG 
CACTTCATCG ctGGCGCCAG TGTCAGTCCA TCCGAGCAGG GGGAGGGGGG TGTTGTTGAC 
TACCGCGTCC AGACGCAGCC ATCCACCGTC TGAAACAAGA AGCGCGTAAA AGGTGCTCTC 
ATTCAGATAA CGAAACAAGA GCCcTGCGGC GCAGGTACCT GCTCGCTCGG GCACCGCCTC 
GCCCATTGCG CTCGGGTCGA CGGCAACTGC CTCCGGAAGA CAGGCAGGCC GAGAGGTCGA 
ATCAAGAGAA GCAGGTACTT CTGGCTGTTC TGAATTAGGC AGCGCACGCG CCTCGCCTGG 
AGGGTGAGTA CCCGGGAGAA AGCGGATGCG CGCAGTAAGC ACGCAGTCCT TATAACGAAA 



1/13041 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
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GACGGGGTTA GCACTCCACG CATAGAGGAA CTTACGTCTG AGGTGGAGCG TTAACCCATG 3840 

CGCGCCACGT GCAGTCTCAT AGCCGTCCCC TGCCTCCGCG TGCCAGCGTG CATGTTCCGT 3900 

AGAAGAAAAG TCAGCGCGCC AAAACTCAGA C AC AATTTC T TCATAATCTA CAGGAGGTGT 3960 

TACACGCCTT TTCAAGAACT TTGCTTTGAA ATATTGAAAA TACAGGCGCG CACGTCCCAC 4020 

GCCCAAACTC TACGCTAATT TTCCCCAAAG TAAAAGGGGA AGGGTGAGTC ACAAGACAGC 4080 

ACACAGGTGA TCACAGGGAG TGCGCGTCTC CGGTTAGGGA AATAAGAAAT GTGGTATGCT 4140 

CCGCCTGTAC GTTTGGACTA TGGTGCAGAG GATAGGAAAA GATGCAACTG TACACTCCTG 4200 

CGCCTATCTG GTACCCTCTG GcATAGTTTT TACCTAAGGA GCATTTCAAT GGCATTACTT 42 60 

GACATAAGTA GCGGGAACGT CCGCAAGACT ATCGAGACCA ACCCTCTGGT CATTGTGGAC 4320 

TTCTGGGCTC CCTGGTGCGG TTCGTGCAAA ATGCTCGGTC CTGTTCTGGA GGAGGTAGAA 4380 

AGCGAAGTCG GCAGCGGTGT TGTTATTGGA AAACTGAATG TCGATGACGA CCAAGATCTC 4440 

GCCGTTGAGT TCAATGTGGC GAGCATCCCC ACGCTTATTG TTTTTAAAGA CGGGAAAGAA 4500 

GTCGATCGTT CCATAGGCTT CGTTGATAAG TCAAAAATTC TCACGCTCAT CCAGAAGAAC 4560 

GCC TAAGGAT ATTTCTTTCG TACGGAGTGT GCTACCAGCT C ATC AGC AAA GCGATGCAGC 4 620 

CGGTGCCTAG GGGAACAGTT ATATTGTCGT AGTCTTTGCA GGGGAGCAGT TCGATGAGCG 4 680 

CCGCGCAGAC TCCGAGCCCC CAGCtGCGCG CGCCGGAGnT CCCAGCGCGT GCACACTCAG 4740 

TGCCGTTGCT ATGCAGCACA CGGCACTGCC TACGGGTGTT TTCCCCTGCA CGGAAAAGAG 4800 

CGTCGTGGTC CTTCCCCTTC TGCGTCCGGG GGAATCAGCG GAATTTTTGA AGCGCAAGAA 4860 

AGTGTATACA GTACCGGCGA GGCTTGCGCA TCCGTCTCCG AAAGCGAGGG CGCAAATGGC 4920 

GGCTGCTGCG ATAGGTGGGG GAAAGAGAAG GATCGCGCTT GAAACACCGA TTGCAAGGGT 4980 

GAGTGGCCCT CGCACAAGAG AACGTTCGTG TTTGTCCCGG TCTCGGGCCG CCGCTCGGGT 5040 

GAGGAGTGAG ACAAGTGGCA ATGTTTTTCC ACGCAgTCTC CACCGCTCGG CAACATAATA 5100 

GCCAACGCCG AGCGTGCCAA TGGCGCCGAG CGTAAGGGGT TTGCTCCATG CCGCAAGCAC 5160 

AATGCTCAGT GCTGACGACA GGTGTATGCC CTTTCTGAGG CATTCGCTTG CAAGGCGTGC 5220 

GTGCATGCTT TCTACGTGAG CGGCGCGGCG AGTACGTGCG GCTGGGTTAG CGCCGGCCTG 5280 

TACGGCCCCG ATCCTGGCTG GATGCTCGCC CCGCCGGCAG GGGGCGGCTG TCTGCAAGnT 5340 

GTTGTTCTGG TCCTCAGCGT CCACCGGCTT CCGTCTTTTC ACAGGGGCCA GCGCAgTACT 5400 

ACGCGCGCTA CATTGAAAAA TACGAAGAGA CTGGTAGTAT CTACGATTGT GGTGATCAGC 5460 

GGTCCTGCCA TGATGGCGGG GTCTAAGCAG AGCTTTTTAG CAAGAATAGG AAGGAGCCCT 5520 
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CCGGTCAGCT TTGCCGCTAC TACGGTAATC ATCAGTGTTA TTCCCACGCT CCCACATAGG 5580 

GCAAGCGGCT TGCCATTTAT GAAGTACGTT TTCGCCAGGC TTGTTGTGCC TAGAATGCCG 5640. 

CCAACTGCCA GCGCAACTCG CAGCTCTTTG TACAGCACCC GGAGCCAATC GTGTAGCTGA 5700 

ATCTCACCCG TTGCAAGGCC TCGGATAATA AGCGTAGAAG AC TG ACTGCC TGAATTCCCC 5760 

CCGGTGTCCA TGAGCATAGG GATAAACGTG GTAAGCGCGG TGGCAGTCAC CAACAAATCC 5820 

TCGTATCCGG CGATGAGATT TCCCGTGAAC GTTTCAGACA CCATAAGAAG TAGTAGCCAG 5880 

CCGATACGGT GTTTCACTAG GGTGAAGACC TCAGTTTCCA GGTATGCTTC ATCTGAAGGC 5940 

TGCATGGCCG CCATGATCTG GAAATCCTCG GTAACTTCCT GCTGCATCAC GTCCATGATG 6000 

TCATCGACGG TGATAATGCC AATGAGCCGT CTTTCAGTAT CCACCACAGG GAGTGCAAGG 6060 

AAGCCATATT TTTTAAACAC CGCCGCGACT GCTTCTTGAT CATCATGGGT GTGTACAAAG 6120 

ATGCAGTCAC GTTCACACAG ATTCTCAATC AACAGATCTC CCTGACTAAG CACGAGCTTT 6180 

TTTAAAGAAA TGACACCGTG CAAAAACCGG TTTTGGTCAA TGACATAGCA CGTGTACACG 6240 

GTTTCTTTTT TTAATCCGGT TTCCCTAATG CAGCAGAGGG CATCGTGCAC GGTCATCTGC 6300 

TTCTCTAAGT CTACATATTC AGTTGTCATG AGGCTTCCTG CAGAATCCTC TGGATATTTT 63 60 

AAAAGTTGAT TGATAACTTG ACGTTCTGTT TCACCCGTTT GTGCGAGAAT GCGTTTCACA 6420 

GCATTTGCAG GCATTTCTTC TACCAAATCG ACTATATCAT CCATGGCGAG TTCTGCAAGA 6480 

ATAGGTGCAA GCTCTCTGTC TGTAATGGTG GCGAGAAAAG CAGATTGTTT GCTGCTTGAA 6540 

AGCTGCGCAA ACACATTTGC AGCGAGGTTC TTGGGCAGCA TTC TAAAT AG CAACAACGCC 6600 

TGTGCAGGTG ATTGCATGTC CAAGACATGC GCGACGTCCA CCTCG TTC AT CTCGTTTAGA 66 60 

TTTGCAATAA GTGGTACGTA ACGCTTGTGC TCGAGAAGCG TCTGGATTTT TTCAAAGTTC 6720 

TCGTTCATAG CCAATTCCCA CGCGGAGTTc CGGAGTATAC GTGAGTTCAC CTCTGTTCTT 6780 

CCATAGGTGC ACGTCCCGCA CGAAAGTGAC TGCTCCTGCT CCCAAAGGCC TGGCGCCACC 6840 

TAGGCAAAAC GCACAAAAGC AGAGAGGGAG CGAGAGACGC GTTCGTTCAG GCAGATCAAT 6900 

CGATGGAGTT CAAAATTCGA GTGAGCCTGT TTTTATCGTA TAGATCCATA GGTCTGCCCT 6960 

CTGCAAAACG GCGCGCCTCA TCAGTAAAAG CACCGGCGGT CATGCAAATG CCCTTGCCAG 7020 

CTTTTAGCTC TCTGATACGT CCATGCAGAT CGCGCAAAAC GAGTTCCCCT ACTGACCCCT 7080 

GAGAGCGAAA AAAACGGAAC AATACGAGGT CGGCCCACTT TGGCGTGTCA ATTTCAGAAA 7140 

CTATATCTGT GTGAGTATTC AGAACCGATA TGTCCAAAAT CTTTACCCGC GCGTGCGGAA 7200 

AGTACTTTGA TACCACCTTT CTGCATAATC CAATGAACTC ACTCTGAGCC GCTATCATGT 7260 
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ACAGGTGTAA ' GTTTCTATTC TGATTTAATT CCTGATAGTA CGTAATGAGC GC CG TG AC AT 
CCCGGTACTC AGGGTTAAGC TGGTGAATCT CTCTAAGCAT AACCAGGGCT CTTCCCAGGT 
TCTGGGTTTT TATCAGTGTC TGCGCGAGAC GATAACGCAG CTCGTTTGCA ACGTCTGAGG 
GTATATTGTC ATG CTTCAAC CCAATTTCAA AATCCTCTGC AGCGTCCTCC AGCTGGTTCA 
GCTTTGTCTT AATGGTCCCT GCATACAACG CAGCCCGCGG CCCAACGAGA GGATCAACCC 
TCAGATGATT GAAGATTTTC AATGCACGGT CG TG TG C ATT C GC C TC AT AG AAGCACTCTC 
CCATTACGAA CAATGTTTCT TTGTCTTCAG GTTGGAGGTC CAGCGCCTTC TTCAGATAGG 
GAAGGGCTTC CTGATACTTT TGCAGTTTTT GAAACGTGTA ACCCGAgcaG s TGCGCTGCA 
GGTGCACTAT CCGGTTGCGA TGTTAGCGCT CTTCTCAAAA GAGGAACGAC CTGCTCATAC 
TGCCGGTCGA GGTAATAGAC CCTTCCCAGG TTGTAGTTTA CTTCAAAATG GTGGGAGTCG 
ATGCCTTGCG CAAGGAGCAG TCCCTTCTTT GCCTCCGCGT ACTTTTCTGT TTTTACGCCG 
CAAATCCCGT ACCGGAGTAG GATCTCAAAC TGCTCCATCT GTGAGCTCGA GCTGcTGCGC 
TCCACGAGTG TCGCGTACAT CGCGAAGGCT TTTTCCCACT CCTGATCCTG GTAGTGTATG 
TCCCCCAGTA CCGAAAGACC CCGAATGTCA TACGGGTCCT GCGCAAGCCG TCTAGATGCT 
TCCCTCATGA GAGACTCTTT GTCTTTTCTT CTGCTTGAAC GCGTCCCCAC CTTGCTCGCT 
ACCGTGGCAA CAAGCATGAT ACTCGAGAGC GCAAGCATAA CGACAAAGAA GACGATAACG 
GCAGAACTCA CTCGGCGCAG TGTGCTAACA AATCATTCAG CTGTCAATAG ACTCTGCAAT 
GTGCCGGTAG gCTGGaAACA GAACTTAAGA AACC TGACTG CATATCTTTG CAACACCTGC 
GCGCCACTCC AGCAGTGTAT TTTTTGTTTC CCACTCAATG AGCCGAACAA TACTAAAGCA 
GGTGGTACTC GCTGGGTTGG GAAAAACAAT AACTGCAAAT TTCCCTTGGC CAACATACAC 
GACATGTTCG TGCGCAGCCA AACACTCAGA CTCCCTTAAA AAACTGAGCA CACAGTCAAA 
ATGAACTGGT CGAC CGGAAT CCTTGTCACC AAGAGCTACG GGAAGATCTT TGATAACCTC 
TTTGCAACGC GGACACACAT ACTCGCAAAA CTCCAGCTTT GGCGAAAAAA CATCACGCCT 
CGGCGTGCGC TTC CTTTTTC TTCTACGGCC GCTTGACTGA ACGTCAGTCA CACTCTATTC 
CTCTCATTGA CAATAACCGC TTACTGAGAA TGAGCGCGAG CCTTTGAATA GCCCCCTCAA 
T AT AGG AC TG ATCTTGAATC TTCTTTTCCA GTTCGTGAAT TGTTTCAGAG CAAAACACcT 
GCGGGAATTC GTCTTCCTCG AGAAGGAAAA ACGGAACCTT TCATACCAAA TCCTCAAAGG 
CATGTGGCAA GTGCACGATG TCCACATCCT GCCTTCGAAA AGGATCGGAC CGCAGCACGA 
TGACGTCAAA GCGCGCACAC ATGTGGTTGT ATTCTCGAGC GCTGGCAAGA AAATGTTTAG 
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CGGTTTCACA GATCCTTTTC TGCTTGCGCT TTCCAACAAT AATTGCTAAG TCGGCGTAAC 
TGGTACACCG TAGCGTCTTC ACTTCTACGA ATACTATTGT GTCATCCtGc TGCGCAATAA 
TATCAATTTC ACCTGTTGCT CTGCGCCAGT TTCGTGTGAT GATAATATAT CCGCGCGTCG 
CTAGCCAGCG CGCCGCATAC GCCTCTCCAA ATGCACCGAG TAACTTATTG TGCTTTGGCA 
TAACTCCACG GACTCACTTC TCCTACAATG TAATTCGTTT TATTGACCGC TAAATCAATG 
AACTCACCGC TGTCGTCAGG ATGGGTTACG TCGTGCTCGT CGATAACTAT TCTTAAACAC 
GGACGCAACG GCGCCGTGGC CGTGGTCTTT ACCACGCGCG CAACTGCTGA GTTATTCAGG 
AGCACCAACG ACCCAATGGG GTAAATACCG ACGCTTTGAA TCATTGCCTT GATTACGTCT 
GGATCAAAGC GACGTGCGTT GTCAGCAAGT AAGCTTTTCA TTGCTTGATA GCCACTGAAC 
GGTTTGCGAT ACGACTTTGG CGCAAGCATT GCAGCAAAGT TATCAGTAAC TGCAAGAATC 
CTCGCACCGA TGGTAATTTT GTTTCCAGAA AGAGACTGGG GATACCCTTT TCCATTCCAG 
TGTTCATGGT GCTGGAGTAC ACTCAGTCCA ACCGAGTTCG GGTATTTGAG CGTGTTTACG 
ATGTAGGAAT GTGCGTAAAT GGTGTGCGCG TCAACTGCCT GCTGTTCTTG AAAATGCAAT 
CTCCCCGACT TCTTCAAGAT GTCGGCAGGA ACATGCTGCA TACCGATATC GTGCAGGAGT 
GAGGCAACGA CCAAATCAAA TATATCTTTT TCAGAAAAAC CCAAATGCTG CGCAACGATA 
ATGGAAAAGA TAGCCGTATC TACTGCAGGT TTTGCAAATC GAAATCCTTC GATTTTGTAT 
GACAGCACTA AAC TG AC AAA CCCGAGTGTA TTTGCTCGAA CTAATTCTGA AAGGCG CTTT 
GCAAGCATGT CCGCAGGCGC GCAGGAAGTG TCGTGTGCGC ATTCATCTTG GCAAAAAGCA 
TGTTCAATTC CTGAATAAAG CCAACGTATT CTTCATGATA GTGGGGATTG ATACACACCT 
TAGGAAGGAG TTCACAAATA TCCTTCAGGA TATCTTGAGT ACGGAg TTCT CCGGTGGAAA 
CGATATCGTC TTCAGGATCA AGCAGTTCTT CTGCTGCAAg cTCTTCAAAT TCTGCTAATT 
CCTCCACGGT AGAAGAGGGA ACTGACTCCC CTTCAGCCAG CACCCTACCT GCGGTCACAA 
CGTAGGGAAT ATTCCAATCC TGGAGCACCG TCAGCTCTCG AGTGCTTACC GGCTCCCCTT 
CCCTGAGGAG GAGGTTTTCT CCGTCGTCGA AGAACACGGG CTCGGAAAAA CACATTCCTT 
CTTGCAGTTC AGATACATCG ATTTTTTGTG ACATGGACCT GTACCTCTTT ACCCGCCTTA 
TCTTCGGCAT CGGTGCGCAC GGGTTAATAC CACGTCTTAT GCACACACAG CGGTAACGTT 
ACTGTTGTGT GTACAAAAAG GCAAAGATTG CAGAGACAAC CTGATAGGTT TCCGGTGGGA 
TACACGCaCC TATCCTGTGC TCAGACAGAA CACGCGCCaG AAGTTCGTCT TGCACCAAGG 
CGATATCAAA CTTTTTTGCA ATTTCAACAA TTTTTTCTGC AATGGCGCCC GTGCCCGAAG 
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CGACAATAAT GGGCGCCTTA TCTCCCGTTG CATAGGAGAG CGCAACCGAG CACGCACGCT 
TTCTTTTCAT GGTGGCGGTA GTGTGCGTCT GCTATACAGA AACGTCAATC CCTTTGAAGG 
GGACCGCATC GTTTGCCGAG TCTGCATGCG CACCATACCG TACGGATAAA AAATCAATGC 
CsCGTTCGCG AAGGAGGGCA CACAGGCGTG CAACTGTTTT TTTCtGscTG TGCGCGACAG 
TTCCCGTGCC GCGTGTCTTT GCACCGTGAG TACGTTATTC TGCACGCAGA AGACCCATTC 
TCCTGCAGTG TTACACGCAC GTACGCGCAg cTGGgTGCAT GTTTTTTTGT GGAGGTGTAA 
TAACAGCGAG AGAGTGCCGC GCCACACGGT GTGCGCGCCG CgCCGCTCAA AGGGTACAAG 
AAGCCAGTGG AGTGCTGAGT GATATGTGTG GTTAACGAGC GCAAAGAGAT CTCGCTCCGT 
GTCAGAGTCT GTGTATGCGC CTGATTCCCC AAGAAAGGTG CTCAGCATAC GGCGGAGCAT 
CGCTGCATCA ACGGGAATGT TTCGGTCGCC TAAGATACTC GCAAGGAACG CGGCGTGTGC 
TCTTTTTTGC TCAGGAAACT TTTTTAGCAA AAGCGCAAAG CGTTCAATGA GCTGCGGCTG 
CAGCGGCACA CACAGGGATG TATGCGCGTG GATAAGCGCT GCAGCTTCAG GAGAGAAAGA 
AACACCCCAG CGCTGCAAAA AATGCGC CG A C ATATCCTC A GCAGATGGAG GAGTGCTCGT 
GCACTGCGGG TGCAGGAACA CCGTACCCGC GCGGATAGAC GCGCGCAGAA ACAGGACTGC 
GCCTTTGTGC ATAGGCTGCG GTACACGCGC GCGCACCCGT TC AC CG TT AA TGGCGACAAG 
CGCACGGCCG GCGTGCGTGC TGCTGAGAAT GCGG AC GC AC ACGAGCGCCC CTTCAGTAAG 
GGAAACCGTG CGCGGCACTT CGGTGAGTAC TACCCGAACA GCTCCGTTCA CGCGCTGGGC 
GACGGCTTTT TAACAATACG TGCTTTGATA CGGGCGGCCT TTCCTATTTT TTCTCGGATG 
TAATAGAGCT TCGCGCGTCG CACCTTTCCT GC ACGTAc T A CGTCGACCCG CTCGATACGG 
GGGGAGTGGA GCGGGAATAT ACGCTCCACT CCAACGCCAT AGGAATTTTT GCGCACCGTA 
AAGGTGCGCC TGACGCCGCT ATTTTTAAAA CACAGAACGA GCCCTTCGTA AGCTTGGATG 
CGCTCTGTTT TTCCCTCCAC TATTTTGAAA TGCACACGTA CGGTGTCCCC GACGCGGAAC 
GTTTCAGCTG GTTCCTTTCG CTGCTGGTTT TCAATTTGTT GGATGAGGTG GCAACTCATA 
GTCTAACTCC TTAAGAAGGG ACTCAGCCTC TTGAGTCCAG GCTGCAGACG CACGCGCAGC 
gctGaGGAGG TCAGGTCTAT TCCTTCGTGT TTTTTCGATC TGGCGCGCAA gcCGCCACGT 
GCGGATATGC GCGTGGTGAC CGGAGAGAAG TACAGGGGGA ACGTCCCGGT TGTGAAAACA 
GCGCGGCCTG GTGTACTGCG GGTACTCCAC GAGACTGTTT ACAAAACTTT CCTCCTCGAG 
AGATTCATGG CGGATGACAC CGCTAACACA GCGGCTCACC GCATCGATGA GCACGAGCGC 
GGCGATCTCT CCTGAGGAGA GAACGTAGTT CCCGATGCAA AtTCGTCGTC GACATACTCG 



10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
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TCTATGATAC GTTGGTCAAT TCCCTCGTAT CTGCCGCAGA TGAGCACGAG GGCACGTTCC 12540 

TGTGCGAGTG AGCGCGCATA GCCTTGCTCA AAGAGCTTTC CAGAGGGAGT GACGTACACG 12600 

ACGCGCTTTT TGGGAGCGTC TACTGAATCC AGGGCCTTCC cTAACGGTTC TGAGCGCATG 12 660 

AGCATGC C AG GTCCGCCCCC GTAGGcGGGG cGTCACAGTG TTTGTGTTTG TCGTGCGCAA 12720 

AGTCACGGAT GTTGACAATA TTGTAGTGAA TGATCCCGTC GCTCACGGCG CGCGCCATTA 12780 

TTGAGGTGGA GAAATAAACC CGCGGGATGG CGGGAAAGAG AGTCAGTACG TCAATGTTCA 12840 

TTCGAGAATC CACCGCTGCA ACAACTCGAT TTTTTTTCGA CCAACGTCCA CGTCCCCAAT 12900 

GAAGGTCCGG TGAAAAGGCA CATAGCAAAC GCCACCATGT GTCCTTTGAA CCTCTAAAAG 12960 

GGAGCTACCG CCCCCTTCGA CAACGCTCAA GACAACACCC ACCGCCGAGC CCTCGAAAAC 13020 

GAGTTCACAA CGACACAGAT CGGCCAGGTA AAACTCCCCA GCGCTAAGCG GACAAGCCTC 13080 

GGCaCgCGGT ACCCGCAGCT CTGCTCCTAC AAACGTCCGA GCGCACTCTA CCGTATCTAC 13140 

GCGGTGGAGC TTGAGCAGCG CGTcCTGCGC ACGTAGGAGA ACGTGCTCTA CCATGTGGAC 13200 

GGCCTCACGC GGGAGAGCAC AAGCGAGGGT GCCTGAGGAT CTGCTCCGTG GAGGAGCAAG 13260 

ACAAACCTGC TTTAGTGTGG CAAGATGTGC ATACTCACCC GAGAAGCTCT TGAGCCTGAG 13320 

TAAACCCGCA ACCCCAAAGG TGCTCACGAT GCGTGCAGTC ACAATTCTAT CCATAACCCA 133 80 

CCACACACCG CCTGCAACAA GGCCGCAAGA CGGAAAAGGA GCCGCTAGTC GATGATCTCT 13440 

AAAGCGTAAC GCGTCTGAGA AGCGTGCGCA GACGCAGAAA GGAGCGTkcG CAGAGCGCGC 13500 

GCAATTCTGC CGTGCTTGCC AATG AC CTTC CCTACATCTT CAGAGGCAAC ACGTAACTGA 13560 

AGGATCTCCA ATCCCTCCCC TGGAGACTTG GTGACGGTAA CCTCCCCAGG ACGATCCACA 13 620 

AGCGCCCGCG CAATATAGGC GATTAGCTCT TCTTCCATCG TGACCATCCC CTTGCCTAGA 13680 

CGCCCTGCCC CCCTGGGGAA GAAGGGATTG GAGCGGCGCA GGAAACGGAC TCTACGTGcG 13740 

CCAGATCGGC AGCCTGCTGT GAAGAAGCAA CACGGCGCTC ATCTGAGGCA AtGCGTTTAG 13800 

GACAGAACCA CGCCTGGACT GCAAGAGCtG CGGACCGTAT CCGAGGGCTG gCGCCGCGyT 13860 

CAAGCCAGAA GCGCGCACGG TCAAGGCGGA AAGACACCTC GGTACCCTTT GGGGCTATGG 13920 

GCTGGTAAAT ACCCAGTTCT TCGATTGCCC TGCCATCTCT CGGCTCGCGC GCGTCCTGAA 13980 

CTACGATTCG GTAGTACGGA CGCTTCTTAC TCCCCAATTT TTTCAGTCGG ATCCGTAAAC 14040 

TCACTCTGTC TCCTCCGCCT GAGACACACG CaGGTGCGCT CATCCTTTCT AAAATTATCT 14100 

GTGCTGTCAA GTGTCTGAGC ACGTAACGGG ACATGGAGAA TAGATTACAG AGGAGCGGCA 14160 

CGTGACGCGT CATTTCTGCG CTGTAACGGT TGTATTGGGG GAGAAGGAAA AC tGCAGTGC 14220 
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AGCGGGTGTG CCTTTGTCTT GGGTGTTCTT ATTCAAAAAT GATACGCACC 
AGGTAGTACC TCGCTCAAAC TCGTGGGCGA AAAGTTCAAA ACTCGCTGCC 
ACACGATCAC GCCTGGGCTT TTCTGCCGCG CCCGGAGATC CTGCAGGAGT 
AAGTAAACGG TCCGTAAAAA GGTACCTGTG CTGCATGAAG AAGTGGTTGC 
TAGCACTTyC TGCAAGCAAG TACAATGCGT GCGCCTTGGC TGCTGCCTGT 
GGTAGTCTGC ATTCTTATCA GTGCCGCCAA CGATAAGAAC CACGCTTTCA 
CCAGCGCTGC AATTGTTGCC TCAGGTACAG TGGATGCAGA ATCGTTATAA 
CCCCCTTTTC GTAAAAAAAC TCTAGTCGGT GTTCGATGCC CGTGTAGGAC 
GTGCAAGACG CCGGGTGTGC TCTTGGAAGG GACTGTGAGC GGACGGACAA 
GGGGGGACGC GTGATTCnCG TACGCGGGGG AATGGGAGTG TGCGCAAAAA 
AGGAGGAAGG AGG TAG AG AA TGCCCTGCGC GAACAGGAcG CTGCAAGCGC 
ACTTGCGTTT GGAGGACACG GCCCGGTACG TGCAGCTGCG GTGGGATGAG 
CGGTCACCTT CTGCAAAACG TGCCCAGTAG GTTCCGTCCG TCGCTCTCCA 
TCCATGAGGC GCGGGGTGCA CGCGCGGCAA GCGGTCTCAG GCGACTGGGC 
AAGACGCGsA TCCGTTTTTC TGCkTcGCAG GCAAAGCGGG GTCCCCACCC 
TACACAGCAG TGTATCGTGC GTTCCCTGGT GTGCGTATAG C AC C TGTTTG 
AGCTTTCCAT ATCCGCATAC CAGTTTTGAT GGTCAGCCAT AATGGGAGTC 
TCTCCGGGCG CAGCAGACCG GCGTGGTGTA CAGTGTGGTC CTGTGCATCG 
GGTCTGCAAG CTGCCAGCTC GACAgTTCCA GAACCACTGG TGTTGCAGGC 
GCACAAATTC CAGCGGGCTG ACTGTGCTAT TCCCCCCTAG AAAGGCGGGG 
CACGCAAGCT GTAGCACAGG GCG CTGGCAG TGGAGGATTT TCCCTTGtGC 
TAGCAGCGGG GCGGGAGAAA GGCGTAGGAA AAGGGAGATA TCCGTTTCgA 
GCGCkTTGAG CAGCGGAAAG GTAGATGTTG TGTGCACCCT TCACGATGGG 
ACAACATGCG CGTTTTCAAA ATCTTCCAGC CGGTGTTCAC CGAGCGTAAA 
GGGTACGCAC GAAgTCTTTT CAGGGAAGGG GTAAGCGCAT CAGCATTTCG 
ACCGTAAGgC GCgCTCCCGC TTCTGCACAA AAGnCAG sTG CCGCGCAgcC 
ACGCCGAGGC CCATGATGGT TACCGTTTTG CCTTGAAGAA GTGCGCGCGC 
ATGCGGCCGA TTGTAGCCcG CGCAACGCGT GACAATAcAA GAGACGCGTG 
GGCACGGTTT CTCTTTACTT TTTGTTGCTT TTTTACTACC CTCGCGCGCT 



PCH P8/13041 

TGGGATTTAA 14280 

CCAGGAGAGA 14340 

ACCTC AAGGG 14400 

AACCGTGCCG 14460 

GCCAAGGGTT 14520 

TCAAAGGCTT 14580 

AAACGCAGTC 14640 

TCCAGCGCTT 14700 

GCGTAGTCTG 147 60 

CACGGGGGGC 14820 

TGCACTCGCC 14880 

CATGC AGGCG 14940 

TAGAGCTCTC 15000 

CGTATACCAA 15060 

GTC ATCTGC t 15120 

TCTGCCACGT 15180 

ATGATGGCAA 15240 

ACTGCGCGTA 15300 

GTTGTGTGAC 153 60 

AAACCCAGCG 15420 

CGCTTACTGC 154 80 

tGgCGCGCCG 15540 

ATTTTTGATG 15600 

GCGGATGGAC 15660 

CAGGTCGGTA 15720 

CCCGCCGTGC 15780 

CTGCTCCACG 15840 

CGGTGCTCGC 15900 

ATCTGCTTAT 15960 
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GGCTGAACAT ACTTC CTGTA CGAGCATTCA TCCTCTTGTG CGCAGCGCGT TTTACGCCGG 16020 

GGGTGCGCAT GCAGTACTGC TTATTCATGG GTACATGGGC ACCCCGCGCG AGATGCAGTT 16080 

TTTAGGTCGT GCGCTCCACC GGGACGGCTT TACGGTCTCT ATTCCCCGTT TACCTGGTCA 16140 

CGGTACGAAT AGAGAGGATT TTCTTGAGAC CGGGTGGAGG GATTGGCTGC GGCGCGTGTG 16200 

TGATGAGTAC CGTGACCTTT CCGCTGCGTa CCtTCGGTAT CTGTGGGGGG GCTGTCCATG 16260 

GGAGGTGTGC TGACTGCACT CGTGGCGGCG CGTTTTTGTC CCCAGAAAGC TTTCTTTTGT 16320 

GC AC CGGGTT TTGCAGTTTC TGATTGGAGG ATAAAGCTGT CTCCTCTAGT CAGGTGGTTT 163 80 

GTGCGTGAGT TTGCTGCGGA CGCGGCTCCC TTCTACCCCG AGCAAGACTT TAATGACGCC 16440 

ACAAAGGATT ACCGGAGTGC GC ACT AC ATT GCCCAGGTGG CGCAGTTTTA CGCACTGCAA 16500 

AGACGTGCGA TCCGTTCGCT GGCGTGCATT CGGAGTACGT TGTTAACGAT CCTGTCTCGG 16560 

CAGGACCCAT TGGTGCCGTG TGCAGCGGTG CAAAAATTAC TCGATGCGCG TGTGCGCACG 16620 

CACACCAGTA CGTATG 16636 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

TGATAAGCCC AGCGAATTAA TAACAAATCC TAAAAAAAGC GTGwArCCGA AAAAGGC C AT 60 

AAGCGTGTGC AGAGAGCCAT GAGAAGGCAT TAGGTGCAAG ATGTGATCGC GTGGTACTCC 120 

ACCACATAGC GCAGTAACGC TGCGAATCAA CGGAGCGAGG AGCGTACCAT GAGCGAAAAA 180 

CTCCCCCGTC AGAAAACCCA TCACCATGCT AGAGAAAcCT ACGGACAAGA ACACGTAGTC 240 

AAGGTGTGCC CATCTATTTA GCGCACGTAC ACGCCGCGTC CGCAGTAGCA AACCAAGTAC 300 

GAAAAAGAGG AGACCCTGCC CGAGGTCCCC AAACATAATC CCAAAAAGCA GCGCATAGGA 360 

GAAAGCAACG AAAGGAGTCG GATCGACGAG CCCGTAGGGG GGACAACCAT AACTAGACAC 42 0 

CATACGCTCG TAACTACGCA CAAAACGGCC ATGCTGGTAA CACACCGGCA CATGCTCGCT 480 

GCCATCCCTG ATAAAAGACA GCTCCTGTGG TTCAAACAGG CGGACTGCCA TCCTCCCTGT 540 

GGTCACGTTG TCCAATCCTG CAACGAGGTC CTTCGCCTCA TGTGCTGGCA ACCAGCCAGC 600 

TATACGATAG GTATGCCGGG TAGACTCAAG CGCATCACGC GTGcGGTGcA CACGTTCCTG 660 
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CAGCGCAAAA CGCCTAAGCA GCGCACACAG 
TTTCTCACCC TGaAACTGCT CTTCGTTGGG 
GGGGAACTGA GGGAGCATCA CCCTCTCGTT 
GTATCTGCTG TGCCAACGTG TAGTCTTCTT 
CGCACTCGCC AGAGACACCC AGGTACTTAC 
TGCTCTGCGC ACAGTGGGAA GAACTGCCAC 
GCGCCGTCTT TCCCAGGTAC TCCAGTACTC 
CGAGGAACTT CATCCTCTGC GATCTAAACA 
AGCGTAC GCG CACTCCCTGc ATGTATACAT 
AAGCGCTTCA TCATAAAAAA TGgACGATCG 
CGTGCGCGCA AGACAATACA GAAAACGGGA 
CTCCCACTGT ACCCCATCTA CAGGATTATT 
ACGCCCCGCG C TC AGTAAGA GAACAATCCC 
CCAGACACCT GCGTATCGCA CCGTTAGAAG 
CACCATAGAA CATGCGCAGA CGAAtACCCA 
CACTAACCGG GAACATGCAG TGCGATCATC 
CGCATGGTAG TAACTTTGAT CTAAGCGCAA 
TACGCGGTTG T AC C AGGAAA CTGGGCTCCC 
TTCCCAACGA AAAAGGGAAA AACGACCCAA 
CTGACTCTCA GAAGAAGAGA GCGTCTTCAG 
GGAAAAAAGC GAAGGGGGTT CCTGATAGCA 
ACACGCCTCC ACCCGCCGCT GGATCAGACC 
AGGCTCAGAA AAAAGGCGCA CCCAGAGATC 
CAGGCGCTCA CCAACCCAAA GCCGAGCACA 
ATCCGCGACA TCTCGCTCCA CAGCCACACA 
TTCGAAGCGC TCCACGTCCT CCTCCAGCAC 
CTCCCTTCTC TGAGCGTCTA TCCGCGCACA 
AACCACGCTC CCCTCCGCTT CCTTCACCAA 
CAtCCtGCGC CTGCTCtTGC GCCTTCTCCA 



464 

TGCCGGGcCT CGCATCGAGC 
CAGAATCATG GGCTACAGAA 
CGCGTGCGTG CAAAGCGTcA 
CGGTAGGCAA AGAATCCCCT 
ACGCTTCTTC AAGACGGCCG 
GCGCCGCGGC GGAAAgACGC 
GGTCCACATC CCGTTCAAGC 
TAAGCTGCTT CCCCCAAGAA 
TCTGCAAGCG CACAGATGCC 
TCGcAGGAGT AAAGwrtGCG 
GCCCGCACGT TCAAACGCGT 
TAAGAAAGCG CTCATAGCGC 
ACGCAAACAG GATGCATTTC 
AAAGcTCCGA AGGwrCnTyT 
CCCAACAGCG T AC AACGCC A 
TTCGGGAATA GAACACAaTG 
ATCCCACAAG ACACGCTCAT 
CCGAGTGACA GCCTCGATAC 
ATCTGGCGGT TGGCAGACGC 
GTAGAGATAG TCGTAACGAG 
CGATGCCGCA CGCACACAAT 
T AC C AGATGT GCACCAGCAG 
ACACAGCGTC CCCACACCCT 
CAAACCGGAT AGTTTCGCAT 
CTACCCAAAA AAGAAACCAT 
AGAAAGCTCG CGTTCTAACT 
GGCCTCATGC CACGCCTGCT 
GGTCTCCCCT GCCGCCTCTG 
CGAGGACAGC GGCCGCATCC 
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CGAGCGCACA 720 

CTCTTCCCAC 780 

ACTTCTGCAA 840 

GGAGAAAACG 900 

ACATACTCTT 960 

AAATGGACAA 1020 

ACTACAAGTT 1080 

CTCACGTGGC 1140 

aCGCGCTTCA 1200 

CATGAAAAAC 1260 

CTAGATCCGG 1320 

CATCCACTCC 1380 

CACCACGTCC 1440 

nTCTTCTGTA 1500 

CTTCTTC ACG 1560 

TCTGCCACAA 1620 

CACTCCGAGG 1680 

ACGGCCACTT 1740 

GAAACCCCGC 1800 

CAAGCAGCGC 1860 

CGCGCAGCAG 1920 

ACCTAGGGGC 1980 

GCAGGGTACC 2040 

AGACGAACAC 2100 

CAAGGAAGGC 2160 

GTCCACACGC 2220 

CCTGCGCCTC 2280 

CAGATTCGCG 2340 

TCCACCTCCG 2400 
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CTAGGCGGAC CATCACCCCT TTATCCATAT TACCAACCCA CGCGGCCGAC ACTCAGCGAG 
CACATCGACC TACTCCCGTA CCTCAGGGGA CGGAGGAACC GCTACCCCAG AGCCCGAGGA 
ATTCCTGCGG CAGGACCAGT AGACTCAGGA GACGTTGGAG CACCAGCACC TTCTTcGGAG 
GkTGCTTCCA CCACTCCACa GCcGTCTCGG CTTGCTTcTg TGCGCCGCTT TCTGCAAGCC 
CGTGTCGTCC GGAGCCCGGT TCAACAACGC GAGAAAGAAA GTCAAACCAA AAAAGAGCCC 
TACCATAACG TAGGAAGTCT TCGTGAGGAC CGAAGCGGAG CGCGAACCAA AGGCAGAACG 
CGAACCGCCC GAAAACATGC CACCAgCCCA TCTCCCTCTT CAGTCTGCAA GAGsATAGCG 
TGACCACCAG AgGCAAACCA CCACCAGGAG TGAGAGTATC ATAACGCTCA GGACAGCCAT 
GCCTCATGCT ACACCAAAGC GTGCGCAAAA ACAAGGACTT CAACCTTTCA CTCCCTGTCC 
CGAGAACAGA AAAACGACCA TGTGAACGGC ACAAAAAAAG AAACCTGtTA CGCAATACCG 
CACACCACCC AGAATCCTTT AC TACTCTCT ACACtGCGCG CGATAGGAAC AAAAGACGCA 
GCCTCCAGCG AAcACCGCCA ATGAGTCCCC CGTCAATGTG CTCTTCAGCC AACAGTGCCC 
GCGCGTTCTC CGCTTTCATG GATCCGCCGT ATTGAATACA CAGTGCCTCT GCGATAGCCG 
CGCCGTACAT CTCGCGGACT ACTGACCGAA TATGAGCATG AACCGCATTC GCCTGTGCCG 
GAGTGGCAGT CTTACCCGTA CCAATTGCCC ACACAGGCTC ATACGCAACA GTTACATTAT 
GCATGAGTGA CCCACACACG TCTGCCATCC CTGCGCGCAC TTGAGTTCCC ACTACCTCGT 
TGGTACACCC CGC TTC AT AC TCTTGGAGTC GTTCGCCGAC GCATAAGATG ACGCGCAAAC 
CGCTTTCTAA CACGCGTCTG ACCTTTTGAT TGATAAGCTT ATCATTCTCC CCACGCCCAT 
GACGCCGTTC GGAATGCCCC ACGATGACTA CCTGTACCCC CAGGTCTTCG AGTTGAAGGA 
CGGATACCTC TCCAGTATGC GCCCCCCACT CTTCACTACT CACGTCCTGC GCGCCAAGAA 
GTACGTTACT TCCCCGTAGC ACCTTCCCCA CCGCGTCTAA AGCGGTAAAA CTCGGCGCAA 
TCATGTATGT GTGCGGACCA CCCCGTAATT CCCGCACGAG TTCCTGCGcA AGGcCACCGC 
CTCCGCACAC GTTTTATGgC ATCTTCCAAT TCCCCGCGAT AAAATAGCCG CGCATATCCC 
CTCCTTAGCA CATcCTTTCG TTCACACCAC AAACACCCCG CCGATAGCTC CACGAGAAAC 
TGTTC TAT AG AGCCACCGCA GAGACATACC CTCGCCAGGA GATCTCGCCC CAACCCAAGG 
GAAAATCTCA CGGCACCACT ATATCCTATG TTTCAAGACA CGAAATGCCC GGTAAAACTT 
TACCCTCAAA GAGCTTCAGC GATGCGCCAC CCCCCGtAGA TACATGGCTC ATGCGACTTG 
CAAGCCCAAA CTTGCTGACT GCTGCAATAG AGTCTCCTCC ACCAACTACC GACGTAGCAC 
CCGCATCCGT CGCCTCTGCT ATCAACTGCG CAAsACCCGT GTACCGTGTG CAAAGGCATC 



2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 
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GAACTCAAAA ACCCCTACCG 
ACGATACTGC TCAAGCGTAC 
CACATCGTCC ACCGcAACCG 
GACCGGCAAT aCCACCGaCA 
GTCGATAAAG TCATCCTCCA 
GGTGTATGCC ATCCCTCCCC 
GACTGCTATC TTAGAAGATA 
GTTGCATACC ATAGGTTCCA 
GCGACGCATA AGTCTCGGGA 
CGCATCATTA ACAAAAATGT 
CTTCGCATCA CCAGATGTTT 
ACsGTCGGGC AGCCCTTCAA 
CGGCACCCCC AACTTCTTTG 
AATGAATGCG TGGCGGTCAA 
CGCATCACGG GTAGGGTCTC 
GATGATGTAC CGCAGAGTAG 
ATCACGCATC GGTACGTTAA 
ATCTTTACAA GTTCTCAGCA 
ACACGTTGCG GCTGATTCTC 
ACGCCTCACA TCACTGCTCC 
ACCCTTCGTC ATTGTTGACA 
CCCTGTAGCT CAGTTGGTAG 
GCGCGGGGGA GTGATGTTTT 
CTTTCACGGG TTGTGGCCCC 
GCCGTGCAGC GTCCCGTGGG 
GGTGCCCATG AGAGTAGGTG 
TCGTAGAaTC AGTGGGGACA 
GAAGTTGTTC GATAGGGGCC 
CGATTCCTCA GGGATTGGCG 
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GACCATTCCA CAACACGGAG CTCACTCCCT TTAAATGCGC 4200 

GCGGACCAAC ATCCATACCC ATCAAGTGCA TAGGAATATG 42 60 

GcTGCGCATC CGCACAGAAC GTGGaCGcAC ATaCGTGaTC 432 0 

CACCACCGcT TTGaGCCTTT TGCAACAGCA TaCGTGcaGT 4380 

CTAGGGAGGT ACCTACACCC ACACCTTGCG CTTTCAAAAA 4440 

CGATGATAAG CGCCGTCGAT GTTCGAAGCA GACTCTCCAA 4500 

CCTTGGCACC ACCGACAACC GCCACCATTG GCACCTTCGG 4560 

GGTACCTCAC TTCCCGCTCT ATCAAAAGAC CGGCCACTCT 4620 

GTACCACCGT AGATGCATGT TCACGGTGCG CAgTGCCGAA 4680 

CCCCATACTG GGCAAGCTCC CGCGCAAATT GCTCCTGCAC 47 40 

CCTCGGGGTG AAAGCGCACA TTCTCCAAAA GCACTACCGA 4800 

TAAATTCACG CTGCCCGACG CAGGAAGGCG CAAAATGCAC 4860 

CAAGGCAGTC CGCAACCGGC TTAAGCCGgT GTTTGCCGTT 4920 

AGGGACAACC ATCTTTCTTA GCGTTCCCCT CTGC TTTATC 4980 

CAAGATGGCT AATGaGCACT ACGTGTCGCG GCCCCTGCTC 504 0 

GAACTGCTGC AGTGACGCGC GTGTCGTCTT GCACCATACC 5100 

AATCAACACG CACGACAACA CGCTCACCTC GCATTGTGAC 5160 

TCATCTCCTC CTTTTTGACG CAGGGGTTAC CCCATCCGCC 522 0 

ACTATATTTC AAAAAAAGAT TCAATATCCG CATC GGTCGG 52 80 

CCTGCCTGTG CGCCCTACAC GCGTACGGGG TGGGGCACAG 534 0 

TTTTCTATGC GGAATGATAT ACCCCGGCGG GTGCTGATTC 5400 

AGCAAATGGC TGTTAACCAT TGGGTCCGTG GTTCGAGCCC 54 60 

TGGTTCTTTC AGTTAAGAAT TCTCATGGAA GGTGGTGTGT 552 0 

TTGGGGGCAG TGAGCAGTAC TTCCAGCTTT TTTAGAATGG 5580 

TCTGTGTGCG CCTGGTTTCT ATCGGAAAAA TGCGGGGCTT 5640 

CCTATGGAGT TGAAGGTCCG TCAGAGTGGC GGAATATGTG 57 00 

TGGATCTGTA TCATTCCTAC AAGCTTAAAG ACCTTGTGCT 57 60 

CGCGCTGTAT CGTCATTGAC CTTGAGGCGG TAGAGTATAT 5820 

TTCTCATCTA TCTGTGTTCG ACAGTGAAAA AGTTAAAAAT 5880 
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CCACTTCTTT ATCTCAGGTG TGCACGGCTC TGTAAAGAAA GTGATTGAGC TCACCCGGCT 5940 

GCTGAATTAT TTTCCCATCG CTGAAAGygT AGACGAGGCT CTTGCAAGGG CCCGATCCTC 6000 

TGCACCGCCG CAGACCGGCT CCCTGTAGGT TTTTCCTCGT CATGGGTTGA ACCCTCCCAC 6060 

GCGCGGGAGG GTTAGATATC CCACAGCTTT TTGCTGCCGc GTGCCGCTGC ACGGACTG C A 6120 

GTGGTAGCGT CTTTCCTTTA TACTCAGCAC TGTGCATATG GTAACGGACG GCGCTTCCCC 6180 

AAGAAGTGGG GTGTCGCTCA TTATCGGCAG ACCTTCCTCA GGTAAGTCAA CCTTTCTCAA 6240 

TGCCGTGTGC GGGTACAAGG TGTCCATAGT TTCCCCTATA CCTCAGACAA CCCGTAACAC 6300 

GGTGCGCGGC ATCGTAAATA TAGAATCCGA CCAAATTGTC TTTATGGACA CCCCGGGGTA 6360 

TCACCGGTCT GACAGAAAAT TTAATCTGCG CCTGCAGTCC CTTGTGCACA GTAATGTAAA 6420 

GGATGCTGAT GTGCTGTTGT ACCTAGTAGA CGCTACCCGT CAATTTGGAG AAGAAGAAGC 6480 

AGCCATCTGT GCATTGCTTG CCCCGTATCA AAAAACGCGC GTATTGCTTG CCTTCAATAA 6540 

AGTGGATGTC CTTCACAATT CGACCTCGTG CGACGAGCAT GCCTTTTTAC ACAGGCAAGG 6600 

CAGCGTGCTG CGGGCCGGCA GCCTGGGACG AgCGCTACAC GCCGCACTCC CCCACCTCCC 6 660 

TGCTGATCGG GTATTTACAA TATCTGCCCT GCACCAGGTT GGGCTCGATG CCCTCATGCG 6720 

CACGCTGAGA GATCTCTTGC CAGAAGCGGC GCCTCTG T AC CCTCAGGATT GCTATACGGA 67 80 

TCAGACCATC GCCTTTCGCG TCACTGAGCT CATCCGAGAA CAGGCAATCG CACGCTGCCG 6840 

GGACGAACTG CCGCACGCAC TATACGCCGG AGTGGAAGAC ATGGAGctGC GCCGCGGCAA 6900 

GCGGGAACTG TGGTGCCGTG CGTTTCTTGC AGTAGAACGG GAAAGTCAAA AGGCAGTGCT 6960 

CGTGGGGAAG AAAGGTGCAG TTATTCGCgc CATACGGCTA GATGCCATCC GCGCGcTACG 7020 

CACACTCCTC CCCTACCATA TTTCCCTTGA TATACGAGTG AAGGTAGACC GCAGCTGGAG 7080 

ACAACGCGAC CACACACTCA GCTCCCTTCT GTACTAGGAT GACCGGTGCC CAAATGAGGA 7140 

ATTCGCCGCA GGGGCGGGCC GCTCAAGGCG TATAGTTACT GAAGGTTCGT CACACACAGC 7200 

CGGAGgTCCA TAATACTGTA CCGCCCCCGG ATACACGTAG CTTGTTTTTA AGGCCCAACT 7260 

CGCACGCCGA CGAGAGAACA CCCGAAACGG CATGCCCTCC AGGTCAACCA GCGCCTTTTT 7320 

TATCACCGGC TTTTGACTTC CATGTCGGCG CTCCATGTTC ATGAGCATAG TAAGCGGCAC 7380 

GCCACCTACT GCCCACTCAG CTACAGAAGA CGTCAAGTTA CGCACCGACG CTACGTACCC 7440 

TGTAAACCTG TGCACGGCCA GAAGACACGC GGTCAAACCG AGCGTATAGC AATAATCTGC 7500 

ATCAAAGTTG GACGGAAAAG CGCATCGCCC TTCGTAACCA AAAAAATGAG CAATGCTGGA 7560 

AAAAACACCG GTGTACGTAC CTTCCTGCTT CATCTGCGCT AAGCGCTCCG TTACCTGGAG 7620 
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AATGAGCAAA CGCTCTGTGT CAATGCGCGA CACCTGCACA TTCCCATGTG GATCC CGATC 
TGCCAAAAGC TGTGTGGAAA TTTCAGCAGG TAATGCGTTA AACACCGCAC GAGCAGAAGC 
AG AC AACGC C TGCTCTATCC AAACGCGCTG CGGTtCAGGA GTGTCCAGCG CCTCAAATTC 
CTGCGCGCGG CGTGCCATCA CCTCGTTGAG CTCCGTAATT AGAGCCTTCA TTTC AGGT AT 
AAATTCGATA AGTCCTTCTG GAACTAACAC TATACCAAAG TGCTCACCGT GTTGTGCGCG 
CGTGGCGATG GTGTCACACA ACGACTGCAC GATCTGTGCG AGCGTTAACG ATTGCGCCGC 
TACTTC TTCC GAAATGAGAC AGACATTTGG CTGTGTTTTC AGCGCGCACT CAAGCGCAAT 
ATGACTGGCT GAACGCCCCA TGAGCTTAAT AAAATGCCAG TACTTGCGGG CACTGCACGC 
ATCGCGCGCA ATGTTCCCGA TAAGTTCACT GTATGTTTTT GTGGCAGTGT CAAAACCAAA 
CGAGGTTTCT ATCGCCTCAT TTTTCAAGTC TCCGTCAATA GTTTTGGGAA CACCGATAAC 
CTTGGTAGAA AT AC C AC TGT TTACGAATGT TCTGCC AAAA GGGCAGCGTT CGTGTTGGAG 
TCATCACCTC CTACAACTAC GAGTGCATCA AGCGCCATAC GCGTGACTGT CTGCGCCGCG 
GmGGcAAACT GGGACTC AC T TTCGATTTTG GTGCGTCCTG AACCAATGAG GTCAAAGCCA 
CCTGTGTTGC GGTAgcATtC TACACGGTCT GCGCATATCT CGATATGATC GCCAGAAAGC 
ACGCCCGCAG GACCGCCTAG AAAACCGATA AGGACAGAGT CAGCGTGCCA TCGTTTTAAT 
CCGTCGAAAA GCCCTGCTAT AACGTTGTGA CCACCTGGTG CCTGACCCCC TGAGAGTACT 
ATGGCAACAC GTAATCCTCG CGGCTCAGGT GCAGTCTCCA TGGGGGAATC TTCGTTTTTC 
TCACTAGCAT TAACGAAnTT CACCAGCGGC TGACCGTAGy CGGCGcAAAA AGAGAGCGCA 
ACGgTcATAG TCTGCCACCG CAgTGGTGGA TAAGCCGCGA CGCGCACAAA CGCGCCGAAA 
GTCCCCCCGA AGAAGATCGG GGACCTTTGG CAGGTAGCGA TGCCGTTCCT GTTGCAAGAG 
AGAAATACTC ATCGATGATT ACTCCTTCAT ATACGAAAAA TAGCACGACC GCACCGCCGC 
ACCCCCACAA CTCACTCTGC AGCAGGCGCG ACCGCGTGTG GATGCGCAaT ACTCAACGCA 
aGAaTAGCAC GTTTAAGAAC CGTCGCTTCT TCTTCATACA GTGAACGCCC CACACTGCAC 
AACGCAGCGC TCTCCTGTGC GATCACCTCC GCCGCCATTT TCCTGTCGTA TCCCATCTGT 
ACAAGAGCAG TTACCAGATC CTCAATTTCC CTCGCATGGG GAGCACACCC AAGATTGCTC 
GGATGTGCAG CACGATCATC TGTCTGACTC TGGGCACAAG AGGCCgCGTC GGTTAGCGCG 
AGCGTACCTT TCAGCGCTAA GAGCATGCGC TGTGCAGTCT TTTTTCCAAT GCCTGGTATG 
CGCTGGAGTG CACATAAATC TCCTGTATCA AGCGCTGCAC ACAAAGCCTG ACTGCTAATA 
CTCGAAAGAA CTTTGAGCGC CTGCTTTGGA CCAATACCTT CTACCTTTGT AAGACTGAGA 
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AAGAGCGTGC GCTCTTGTAC ATTCGAAAAA 
TACAACCAGG TAAAGACCTT AACGTGTGAT 
GCGGACACTG CAATTTCCCA TTCAATACCA 
TGGAGCGTCA AGATACCGCT GATGCTTTCG 
TGAAGCGAAC ACTACGCTAC CATGACAATT 
GTTCCCGAGC AGGCTGCTCT GCGTTGATAT 
CTACAGACAG GATATTATCC AGTGTTTGCG 
CCGTCTCCGG CAAGAGATCC AACGCACTGT 
CATCAGAAAA CCAAAAATCA TTGAAATCTC 
CGCGACGTAt GCAGCAACAA AAGCGGCAAC 
GGAGTGTTCC TGTGCAGGTT GCGTGCTACC 
AAAGTGATTC GTTACCACAA CAAAATCTTT 
ATGTGCCACC AAAGATTTAC GTGTGTTTTG 
AGGATTTTTT ACCATCTGTC TTCCCCCGCG 
TGCACTTCCT GTCTGGTCCT GCACCAGCTG 
CGAATATTTC CTCCCGGTTG TCCGCCATCG 
TTaTCCTGCG GATCGATATT CACCGCTTTA 
CGTACCAGTA AATCCAACGT GTGCTGTGCG 
TCGTCATCCT GTATCTCAAC AAGACAAATA 
TnTTCGCAAA GACGCGCGCA CGCGCTGAGT 
CATTATAACT CGCTATATTC AAAAATCGTG 
TAAAACCTGA GCGTCTCAAA GGGGGAAGCG 
ACGAATACCC CATGATCCCC ACCACCGTCC 
CGCTTTTGAA TACTTCAGGA AGGCTGTCAA 
GACGTATATG GGTTTGCTCA TACACGTATC 
TATCCCCCGG TAGGAGGTAA TACGTAGAGC 
CCATCTGAAC CCGCATCCCT TCCACACTTT 
CGAGGTCTGC AAGGTTGCTG ACAAACACCG 
CCGGTTCAGG CAATTCCCTG CCATGTGCTA 



PClfMi/13041 

469 

CCAAAGAGGC GAAGCGCATC TTCACGGTGA 9420 

CCAACCTCAC CGAACGCAGC AC TACTGTAT 9480 

TGCACCTCAA CACAGAGGCG CTCGCGCTCA 9540 

AACATTATCT CCTC TTTATG TGTGGCGCAC 9600 

GCACAAAGAC CGGATCGTGA TCAGAAACGC 9660 

GCAGTATATC AGCAGTCTCC GTGCGCGCTC 9720 
AGTAACCGCG GTACACATAG GTATATCGCT 97 80 

GCATCCCCAC TGCGGTGAAT TTTTGAATAA 9840 
CCGCCACCAC CACCGGAAGA TCTGCACGCT 9900 
CTGCGCCGCC TGCTGTATAC GCTTGCGTTT 99 60 

CCAAACGGGG TCATCCCCTC GCTTTGAAGA 10020 

CCCCTTATTC ACCCCTGATA CAAACTGAAA 10080 

AAAACTTTCT TGCCCTACTC CGATGCGCGC 10140 

CACCATTTGG GCAACCGAAT GAAATGTTCC 10200 

CACACGATCG GTAGGTACAA ATAACAACAG 10260 

GCATCCAACG ATTGCACGCC CGCaGGAGCa 10320 

TACCGAACGG CGCTGAACTC TGCCATtGCa 103 80 

CTCGTACAGT GATGATGTTT TTTTGCGCCA 10440 

ACGTCCGGCG CCTTAAG ATC ATTCACAAAG 10500 

CTGCTTTATT CCCTGCAGAA AAATTCTCCA 105 60 

CGTTGAACTG TATGG TCGAA ACTTCAGGAC 10620 

GCTCAGCAAG TTCTAATTGG TAACTAGAAG 10680 

CTTCAAAGGA ATCACCAGGC AGAGGAGGGG 10740 

ACATACGCCG GGGACAAAAG GCAAGGACAG 10800 

CTCCGTGCAT ATTCAAACGT GTAGAAGGGG 10860 

GATACGCAAC AGCAGGAACG GTGGGATTCA 1092 0 

CATAAAAATC AATAGTCTCT GCATCCGGTG 10980 

GCTGAGACAC CCGCGCATAC GAAATCAACA 11040 

GCACTCGCAC ATCCTGCGCG CGCTTGATAA 11100 
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CAAGCTGGGT GACGCTCAGA TCCCGAGCAT TGCCTTTTGA GATATACTCG CTGACAGTAC 
CGAGCACCGC CACGTAGTCA CCCACGCGCA AACTATcAGG GAAAGCCTTA CCACAATACA 
CAAAAATGCC GTCAGACGTT TTAGGATTGC CATCCCCATG CGGATCTTGA AAATAAAAAC 
CAATAGGTCG TTTACCCGAA CGCGCAATAG CAGTTACCAC GCCACGCACA TCACGCACGT 
GTTTACCCTC ATAGGCAGAA CGGTGTCCTT CCCCTTGGAT CGCACCGATT GAGTGGGGAA 
CAGACGCCGC ACTGCACCGT GATCCTGTTC CCACTATCCA AAAGATGACC CCCACGCACG 
CTCCCGCTAC TTTACTGCTC ATAGGACACT CCTCACGCGC AGTGTATCAG CGCAGGTAAT 
TTTGCACAGT ACGGTAGTTT TCTTTGGTGA CAACTTTATA GGGGATCCAC ACACACTGCT 
TTTCTGCCGT ACCGAATACT GCCGCGCGTC CTGCCACTCC GTTTCCTGCG TGCGTAAACG 
CAGTGCTGTC TTCAGCGGGG ATCGATGGGA CAGAATCACC CATAAGTGCA AACAAAAGAT 
TTAAAATAGC CTTCCCCTGA CTGGAGACAT CGTTAAGGAC GGTGCCGAGC ATCAGATCCT 
CTTCAATAGC TTTCAAAGCA GACGCAtAGC ATCGATACCC ACAACCGGGA CACGCTTATT 
TTCTTTAAAA AAACCTGcAC TCTGCAACGC TTCAATGGCG CCGAGCGCTG CGTCATCGTT 
ATTCGCAAAT ACTGCCCTCA ATGCGATCTC CGTGTGTGTG AATAAGCGTG TGCATCGcAG 
CyTGTCCTTT CACCcGACTG TCAaGCGCAA AaGCCTCCCc GATTATCTCG CCcTTTAATC 
CGATTTCTCT CAGCGCCTGA CACACATACC GCGCACAGCG AGCACCGCTT TTATGATCAG 
GATCCCCTTT GAGCAcTACG CATTGGATAA TACCGTCGGC GTTCTTATCT GCACTTGGTG 
TACGTTCCAG ATATTGCGCA ACCAGTCTGC TTTGCAGCAA ACCAAGCTCG TCGTCCTTGA 
CGC CTACGTA ATAGGCGCGT GCATACCGGT TCAAATCAGA AAGGTCAGGC ATACGATTGA 
AGAATACTAG CGGAATGCGC GCCTGCTGTG CCTTTTCAAT AACCGTGCGC GCAgcaCGAt 
GGTCTACAAG ATT T AC CGC A AGACCGTGCA CGCCGCGCGC AATAAATTGA TCGATGTGCT 
TGTTCTGAAT ACTCTGCGAT GCCTGACTAT CCACGATGAG GATTCGAGCA TGTTTTTTGC 
CAACCGTAGA GAGTATGTGA CGCAAGCGCG CCACGAGCGT GTTGTCATAC TGATACACGA 
CTACTCCGAT AGTCGGCTTT TCGCTGCGCT TGCACGCGCC CGCACCAAGT GCACACAAAA 
GGAGCGCTAC ACACATCCCT GTACCTTTCA TATTTCCTCC TCATGTTCAC CAGCGCATTC 
TGATTTGACA CTTCTTTCCC CTCACACCCT GATACCCGCG CGAGGAATAT AGAAATTAGA 
AAAAGGATGG ATTATCCAGT GCTGCCACCA ATCGCATGAA CGTGTCTATG TACCCGGCCT 
TGCGCCGTTT AGCGTAC AC g TCTGCGACAA TCGCCTCACT TGCCTCAATC ATGTATATGT 
TTTGGATTTC TATTACGTCG TACGTGCTAA AGCCATGCTC ATCTGCAAAA TTGCCAGGAA 



11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 



Printed from Mimosa 02/03/22 07:25:11 Page: 472 



WO 98/59034 




13041 



CCAGTGACTC CACACGGTAG GTGTTGCGCG TGCCGGTGCc gCCAAGCGCA TCCCAAAAAG 12900 

GGGCAAGAAG GCCCGGTACG ATAAGTCGTA CTTATACAGG ATCCGGGCAG GATGTTCCGG 12960 

CCGTACTCCT AACTGGCAGT ACCATTCATC CTGCTCATAG GCGGCCAGAT CATGCAGTTT 13020 

GCTCTTGTCC CGCAGTCGGT ACCCTGCAAG ACGGACGATA GTTTCGGGCA CGTATTTTTC 13080 

CAACTCTGCC TGTAAGTCAG CTACACAAGA AACGGATGCA TCATTGACCG TGGTAATAAT 13140 

TGCCCCTTCT GGTATTCCTG CGTACGAGAG AGGACTGCCA GGAATAACGT ACGAAGAAAG 13200 

CACACCGCCG ACGCCAGCAT TTTTCCACAC ACGGTGTGTT TCACCAAACG CACCAAGCCA 13260 

CGGATGCGTC ACCAATCCCC CGCGGTACAA GTTGGCAGCA CCTGCTTGAG CAATTCTACA 13320 

GGAAATGGCA 1333 0 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10214 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

ACACGGTGGC GCGCAGATTA CGAAGAGGTT AAGCAGCTCG GTGGTTTGTA CGTCATTGGC 60 

ACAGAGCGGC ATGAAAGCAG GCGCATTGAT AACCAACTTC GGGGGCGTTC GGGGCGTCAA 120 

GGGGATCCAG GCCGCTCAAA ATTTTTTCTC TCTCTGGATG ATGATCTTAT GCGCATTTTT 180 

GGGGGGGAGC GGCTGAAGCG TTTTATGAGC CGTGTGGGTA TGGAACCAGG AGAACCTATC 240 

ACGCATTCCT GGTTGAATAA GAGTATTGAG CGCGCGCAGA CGAAGGTCGA AGCACGCAAC 300 

TTTGATGTCC GTAAGCACTT GCTGAATACG ATGATGTGCT CAACGAACAG CGCTCCTTCA 360 

TATACgCGCA GaGcACAAAT TTTGATAGAC GAGCATGTGG TAGAGCGCGT GTATACCACA 420 

ATCGAGGAGT ATCTTAACCG AGAAATAACC GCACTTCGGC AAGAATTGAA GCGGCGTGGG 480 

cGGCTTTCCC TCGGGGCGTT TCAACAAAAC CTGAGCACCC TGTTCGATTA CGCACTGGGA 540 

GGTGAGGACG CATCTGGCTG GAACGAAACG CGTCTTGGAA CGCTGAAGCA AGAAATCCTG 600 

GCGCATTTAA AAAAGAATAT TGAATCAAAG TATCTGCTTG CAGGGGCGCA GAACATGGAT 660 

ACGTTCATCC GCTACCAGTA TGTGCAGGCG ATCGATAAAA AATGGCTGGA CCATTTGGAA 720 

CTTCTTGAAA TCCTCCGGGA ATCGGTGTAC TTGCGTTCAT ATGGGCAAAA GAACCCGCTT 780 

ACCGAATACA AGCTTGAAGG GTTCGACCTA TTTTACACCA TGTTAGACGA CATTCGCCTT 840 
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TCGATCGCCT CGCAGGTTGT GCGCGTAACG GTTCACATGG AAGAGCAGCG CGTCCCGAGG 
CCACCACACt TGCACAGGCG GCACACGAAT TTCAAGCACT GGGGCAGCCT GGCAGAGGGC 
ACGGATCGCT ATCTGCTCTC CCGATTCAAG CCGGCGCAAA AGTGGGGCGC AACACCCCcT 
GCCCcTGTGG AAGTGGCAAA AAGTACAAAC ACTGTTGTGG CCGCTGAAGA GCAATCTCAT 
TATTTTGCTT GATGGGCAGG ACCATCCAGA TGTCTATCCT GTTCCAGGTA AAGACCGCCG 
CTCAGAACAG AATGATAAAT TCTTCAAGAA AGACATGGGT AACTCTCCCG AGACTCAGCT 
GTGTGTTGaG CGCATCGCcA AGGCAATTTT TAGCTGCTCG GGGCCAAGCA TATGCAATTC 
TTGCGCGGTG TGCTGTGCAC a aCGCAAGAG ACGCACGTCT CAGTTGTTCT TTTTTCTGAA 
CGAGCTCCTC ATACAGGGCG TGATCTGCGG CAGGATATTG CAAAAGTGGG GTAATGATGA 
CACTCACAGG CTGCTTATCC GCACTCAGCG CGCGCAAGTn CCAAGTTCTG CATAGACTGC 
ATGGGCAGAT TCCCGCTGGG CAATCCCCCG GGTTGTACGC CTAGTCTGCG TTTCCCTTGG 
AGC AC TGTGT GTCCTCACAC GGAACGCCCC GCAGTGCCGA GAAGTAACAC ACAGACGATG 
AGCGCTGCGA CAGTTCTCAA TGCACGGATA ACACGTTGTG CAGTCTCCTC AGTCATGGGG 
CATTGTAGCA CGCACAACAC TCACTGCACA GCGATAAAGA CTTgCTTGAC AGCACCCTTG 
TACCCTCGTA CACTGGGGGC GGGCATGGGT GTTCTTTCGT GAAGACAAGT CTGTTGCTTT 
CCGTTTGCGC mgsGCTGCGC TGTCCGGTTG TGCCACGGGT CAGAGTGATG CGGTCACAGA 
CCCGCTCTCG GTTCTGGAGG TTTCTCAGAC AGAGACGAGA GAGGCGCTGA TGCTATTTGT 
CTCTTACAAC GAGACGGGTG CATCTGTCAC C ATCTTT AC C CCTGAATTGG TTGCGCGTCT 
TTCCAAATCG TATCGCTTTC TTCGCGTCGA GGCTCCTCAC AGCGCATACA CCCTTTCCCC 
TGAGGCGCGC GAACGTAATC GCTTGTTGTT TTCGGAGTAT GAGGTTGATG GC CTTCCGTT 
CCTTGTTCTC CaAAGCGCAC AAGGGGACGC TTACTTTGCG CAGCGCATAC ATTCGACGCT 
GTCGAGCGAG CAGGAGCTGT GGGCGCTAAT ACGGTCTGCG GACGCTTCGA GAAAAAAAGT 
GCTGGCGGCG CGTGACCGTA TCGCTCAGAC CGAAGC TGCT GAAAAAGCAA TTGCCATCGA 
TGCATTTCTT AAGACGGTGC GTTACCCACG CTCTGCGCGG TACGACGCCC TCCGAAAAGA 
AGCACTCCAG GCTGATCACG AAAATGTCTC AGGTCTCCAC GGGGATTACA TGTTTCACCT 
GGCACGGCGG CGCGCAGAGA AATTTATCAA GCAAGAAAAC CTTGTAGCAG CGGGGAATGC 
TTACAAGGAT TTAGCGCAGT CACCGTTTCT GAGTGCATCT CAAAAACAGG AAGCGTGGTA 
CCTGACCGCA TACACCTATG C TCTTTC AG A AAAGGTATCT ACAGAGGACG TATCGCGTGC 
TTGCGAAAAG CTGTTGCAGC CCATCCGCAT GCTGCGCGGG TTGCACAGAT CAAGCAAACC 
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ATAAAGAAAC TAC TTACCGA GAGAGGcATA TGAACGAGGT CGATGC AGTA AAAAAGGGAT 
AGACAGTGcA AATCCTGCAG CAAGTTGCTC AGGGGATCGC GTCTCATTTC GGGCATGACT 
GTGAGGTGGC TGTGTACGGC GTCAGTAGCG ATGGTAAAAA CTGCGCGGTT GATTTTATCA 
CAAATGGACG CGTTACCAGT AGCAGGGTTG G AG AC AG AC C CCGCCTGTCG CTCTTCAAGA 
ATTACGGAAT AGAAACAGGc AAGGGCGGCT CAACTACCTC ATtCGCACGG AGAACTGCCG 
CTCCCTTAAG TCGAGCATGT TGTATATTCG TGACGAACAT AGCACGGCTC AGGCGATTCT 
AGCGATAAAC TTTGATATTA CTGCTTTGTA GGTTACGCAt TTGCGCTTGG CCGGCTCACC 
GGCACTGCTG CGGAGACCGC CTCGCATATC CACCTTAAGA GCGTCAGTGC GTTCCTCGAC 
GACCTGATAG AAGAGTCTGT AGAAAGAGTA GGAAAACCTG CAGCGCTCAT GAGTAAAAAG 
GAAAAAACGG ATGCCATCCA CTTTCTCAGC CAGATAGGGG CGTTTCTCAT TACACGCGCG 
GAAGACAGGG TCTCCCACTA CTTCGGCATT TCAAAGTACA CCCCTACAGT TATATCGAAA 
CTGGCAAATC GTGATCGCAC CGGACTGAGT CCCCAGCAGA GGGATCGCCG GGCCCTACTC 
CTTCCCTGGT TCAAGCTCCT CGGcGAAGAC AACTCCTCCG GAGCGGACCG CTCGCACCAC 
GCTCCCACCC GTCCTGAAAT ACTCGGACAC CACCGGTGAG GTCGCCGCGC GGAAAGATTC 
AACTATTCTT TGTCCCACTT CGTCCCCGTT CCGCGGGATC CGCCGCACCC AATATAACGC 
AGCGTTACGC GGGTCTTTGG TAAGCGCGTG CACCTCCCCA TCCACTAGCA GCCTTTTAGG 
CACCCCCTGC ACAAAATCTA CTGTCACGTG CCTACCTGTC ACCGGGTGCA ACCACTCAGT 
ACGCGCATGA CCGGTAGGTA GTTCTGTGTG CGAAATCGTC ACACGTCCAC GGCCATACCA 
TTTTTGCACC TTCTGTCCCT TCGCTCCATA CTCTTCTTGA TAGGCAAAAC TTCGATCCTT 
GTGCACGTCA ACTGACAACG CACGTACCGC CCCTTGAGCA TTGTAGTATT CCCGCGTTTC 
AAAAAATCCA TCATCATCCC GATCGCTATC CCGTtCGCGT TGCGCGTCCA TCCACATAGT 
GCGTACGCGC ACGAAAACGC GATCCAACAT GAGTTTCAGA AAATAAAGGC AGCCCTTCAT 
CAAGGTACGT CC t TACGCGG GCACGCTCAA ACAGGGAATC AGGCTTTTCG TAATAAAGGG 
AAGAAACTGT GATCTGCTGC TCAGTAGGGA GCGGCTCATT TGTCAATACC ATTGTGAAAA 
AATCGTGCGA CCGCACACCT TCTAAATCTC GCGCAAGATC TAAGGACTGC ATGCGTACCG 
GCTGCCACCG AAGTGCACGA GGACGCAATA CGTACGTTTT ATCCTCCCAC CCCACCTGGT 
GTACTTCAGG GTAGCGATCG TAACACACTC TGTAGCCTTG CTGCGTCAAA GGAACACGCG 
CCTGAGTTTC TCCCTCCACC CCATTATCTG CAGGAAGCGA CACGCGCGGG GCAATCCAGC 
GCTCAGCAGC ACGCTCATGG TCGTGCGCGG CAATATTCTG GGGTATGGGG AACACAGAAG 
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CGCGCGACAA TTCCGTCTTT TGCGTACTGT GCATCGTGTG CACGCACGTT GGCGCACCGT 43 80 

CATTCGCATA GACTTCATAC TCGAGGATGC CATCCTGGTT TGTATCGAAC TGTGCCCGGC 4440 

TCGGCCTTCC TGCTTTAAAA AACACACGCG CAGAAACAAT GCCGTCCCGA TTTTCATCGG 4500 

TATACAACAC ACCTTCGAAC TCCGCAAGAA ACCGCGCAAA ACGTGCGCGA ATCGGCCGAC 4560 

TTGCAAGCAA ACGACTGAAC TCATGCAGCA GATCCGCATA CAAAACCACC ACGCGCTGCA 4 620 

AGGGAGTGGA TGAAGACACA GACGACGCAT GCACACCGAA GCTTGAAACA TCATCGGTCG 4 680 

GGTCAACCGC AGGAACTGCC AGAGGGGAAT TCAGCGTACA GAACATCTCC ATCGCGCGTT 4740 

GTTCATCCAG CACTCCATAC TGCAATCCCA ACACAATTGA CTGGGCACGC GCGTACAGCT 4800 

CCTGC TCAGG AGC a GAATCC CCTGCGCGCG TGCTTTGCTC ATATGAGGAT TGAGCAGAAC 4860 

TCTCTCCAGG AGCATCTAAC GGACGCAAGG TAAAATACGT TTGCAGATAT CGAAATGCCA 4920 

TGTTCGTTCG AGGTTCAAAT AGAGCCGCCT CTACAAGCAG CGATGGGTCC TGCTCCTGCC 4980 

ATACAGAAAG ACGTGACAGG ATGGAATCTG CTATCTTTTT TGACCGCGAC GAGGGACGTC 5040 

GTGAACGCTC TTGCGCAAAA AACAACTTTG CAAAGCGCGC GTCAAGcgCC CAACGTTCCA 5100 

ACGCTTTTTC GATCAGCTCC TGCGCGTGCT CAACTTGTCC GAGCCCGTAA CGTGCCCGGG 5160 

CAgCAACCAA TCTGCATCTG CAGACACCTG CTCAGCCGTT GCAAGAAGTT CCAGTGCACG 5220 

GGCGTGCTGC AACG TATCC A CACACAGACG AGCATAAAAA AGCCGCACCT CTTCTATATC 5280 

GTACACACAC CATTGCATAT CCTTTGCCAC GGCACGGGCC ATCCACTGTA ACGCACGTGC 5340 

GCGCGGCTGC TGCAGCGCGT AAGAAGCCCG TGCAGCAATA AATAAAAAGT CTGCTATTTG 54 00 

CGGAgcaGAA GCGACTCCCT GCTCTGCCTG GGACAAAGCT TCCTGCCATC GCCCTTCCTG 54 60 

CAGATACCGA GCCGCAACAC CTGGATGATT ACGTTCTAAA TCCTGTGGAG GTGCAGGTTC 5520 

AGACACGCAC GAGGCATCTT GGATGCCAGA AAATGCACAA AAGATACTCA CTGCACAGAG 5580 

TCTTCCCATA CGTGGGCACA TTCCCTTACT CATGAGGAGT CCTCTCCGCG TAACGATTTT 5640 

GGTAACCAAG TGctGCGCCA TAGAGCGCAC GCAACACACC ATCCTGCTCT ATCATCGGTA 5700 

ACACCGTGCG GTCCGATAAA GGTACGTGCC ACTCTGAGAA CATCTTTCGG ATCCCCTTAT 5760 

GACCACCGCG GATGGAGATG GTGTCTCCCG TGCGATGGGT TCTGATATAA AAGGGAAAAG 5820 

AAAACGGACC TACACCCACG TGGTCCTGTG CGCAACAGAC AAACACGCC G GCAGGACGTA 5880 

CTTCCACAAG AArgTTCCGC ACGCACAGGG GTAGGCACCA GGACGCGCCA CGTAGATTGC 5940 

ACTCACTCCT TGCTTTTCAG AGGAAGGAGG TGATCCTGCA TCCTGTTTCT TTGTCTCACG 6000 

TGCTGTGTCC GACGCATGTA TGCAGGAAAA AAGCACATAT GCACCGGCAC GCTCTAACTG 6060 
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CAGCCCTGAA ACGTGTATCC GACGCACACC ATCAAAACGC GCACACCGTT CAAGCGCCCC 
GCGTGGCACC CGGTGCGAAA CTCCCAAACG AACGCAAGCC TCCTGCAAAA GAAAGAAGCG 
CAATATAAAT TCAGCGGCGA GAAAGTCCGA CCGAGGCATC CGCAGACGCG TGCCCAACGC 
ACGTGGTACT GGTTCCCACG CATGCGAACA ACCTTCACGC CACCGCGTCA AGGCAGCAAC 
ACAAAAGCTG TGTTCTGCAC TAATCCCCGC AAACGTTTTG TCTAAGCCAG AGCGCCACCC 
TGCAAGCACT GCATCAAGTG CAGGGATAAG TTCATGACGG ATACGGTTAC GCACATATTT 
CCTGCACGTA TTTGATGCGT CTTCGCGCCA ACGCACACCA CGCGTCTGCA AGAAATCTTC 
AACACACGTG CGGCTCACCT TTAGCAGCGG ACGCACGTAC CGTCCACGCG CAcTCGTATA 
CCTTGCAACG CGGAcGcGgC CGCTCCCTGG AATAAGCGCA TGAGCAGTGT TTCGTACTGA 
TCATCACGGG TGTGCGCGGT TAGAACCACC TGTGCTCCGC AGCGAGCAGC CACGTGGTCA 
AAGAC CTTAT AGCGCAGTGC ACGCGCCGcg TCCTGCACAC CGCGGCCACG AATTTTAGCA 
CACGCGTGCA CCGCACCGGc AGAAATCTGC TGCACGAAAC ACGGAAGGGG AGGAGAAAAA 
CGAGCACACA GCGCACGCAC AAAACGCGCA TCGAGCGCAC CTTC CTGAGC GCGCAGACTG 
TGATCAACCG TG AC CGCGCA CGCACACACC CCAAAGTCAG GAGCGAGCTC GTGCGCCGCA 
TAAAGAAGCG CAAsmGAnTC GGCACCTCCT GAAACCGCCA CGAGCAAGCA AGAAGGCTTT 
CTCGGCACAA GGAAATGCCC AAAGctACGC GCCACGTGGA CGAGCAGCGG GTGAAGCTTC 
TGCCTAGACT CACTCACCTA TAAAGACGGG CACGCTGCAC AGTGTGCCGC ACCGCGcgCG 
TTACACCGCG CACCATCTAG CCGGTCCTCG CGCCAGCGGG TGAACCCGCT TCGGAAGCAG 
AAGAACTGAG TGCCACAATC ACCGCATCAG GATCGCTGAG AACCACCACC GACGCGGGCA 
GAGGAACATC ACGCACACGG CGCACGTCGC CGGCCCCGAG CCCACTGATA TCAAGCACAA 
CACGGTCGGG CAAGTTGCGC GGCAAAGACT CTACCTCGAT ATATGAGAGC CCCTTTTCCA 
AGCGAGCCCC ATAGCGCACT CCTTCAGGAG AACCACACAA CTGCAGCCGG ATTCGCATTC 
GCAACGGAAC ACTCTCTTCA Ac TGCGTAGA AATCCACATG CTCCACACGG TCACTGACCA 
TGTTATGCTG ATAGTCCTTA ACAAAAACGC AAAAGACCTC GCCACCATCC AGTTCCAAAG 
ACAGAACAGT ACTCCTGGTT AAGGCACGAA ACAATCTATc GAAgnTTTGt GCGCAAGTTC 
aAGGGGAACG GACACGCCCC GATGGTCATA CATAACCGCA GrCAAACGCC CTTCCTTTCT 
GCCAgCACAG CGGCATACTT CCCCAACTGG ACGCGCCTTT TCCCCTTCAA ACGCCTTTCA 
TCCACAATCC AATCCTCCAT GCACAGAAAG CGAACACGCC GCAAACTGGG ACGGTAGGAT 
TCGAAC t ACG GAATGACGGT ACCAAAAACC GTTGGCTTAC CACTTGCCGA CGTCCCAAAG 



6120 
6180 
6240 
6300 
6360 
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6540 
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ATATCCTACC CACATAACCG GACACGCACA CACCAACACC ACCGCTTTGC AAGCAGCTTA 
TGCGCGCGCC GAAGCTCCTC CTCATCCCGA TACAGACCAA AAACC GCGCT CCCGCTTCCA 
CTCATCGCTG TAAAGCACGC ACCCGCACGG GCCAGATCCC AACGCGCAAG GGCGACTACA 
GGGTACCGAc GCTGTACAGG GGCATCTAAG CTATTAAAAA ACCGCCACCG CGCACAATCC 
TGTGCATAGT GCGCAGAAAG CGCGGTAGCC CCACGCAGAG AGTACTGCTC GCCGTCGGCA 
GCATGTACGC CGCACGCACG CAACCTGTcC AAATCCTcAT AGGcCTGTGC AGAACCGCTG 
TGCAATCCCG GcCAGACCAA AAGCCCCAGA TAGCCAGTCT TTGGAACAAG GGGAACGAGC 
TGCTcACCAC CACCTAGcAC gCACGCAGcC TGGGAAGCCA GGAAAAAAGG GACATCAcTG 
CCGACACTAT ACGCCAcTTC TCGTAGAAmC CGAGCAGAAa GGGTCGTCCC AAACAAgTAT 
CAaGGCCACA CAAAaGCGCG GCAGCATtCA GCAGACCCCC CACCAAGTCC AGACCtGCaG 
GGATACGCTT CACTACGCGC ACGCGCACAC CATCGTGAAC GCCAGTTACC TGACAAAACC 
GCGCATACGC ACGGGTCAGC GTGTTTTCTC GAGGCAGAGC CATATAAGGC GAAC AC AC CT 
CACACCGGCC AGGGATATCC AGGCGCGAAA GAGACAAAGA ATCCGCAAGC GTAATGCGCT 
GC ATT AC ACT CTCAATCGAG TGAAGACCAT CGGCCCGAgT GCACCAACCC ACAGATGCAT 
GTTCACCTTT GCGTGAGgCG CAAACTCAGC GACTGCACCC GCCATTCTAT GACAAGCGGA 
CACAGCGTGT CAATTCCCCC TTCTCTCTAC cTGCACCCAA AACACAAGAG AAAAAATACC 
TGTGC CTATT AGGCACAGTT GACAGCGTGT GCGCTCCCCT CTACGATCCA CCCCTAGCTT 
TCACCATACC ACAAGCAGAG GTCAGCCATA TGAACGAGAG AAACAAGTTA CTCGCACGCG 
CCCTGTATTC CTGCGTTCCA CACGTCCAAG GCTCGGACGA CTACGAGGAC GACTTTGAAG 
ACAGCGACTT CCAGGACGGG GATTTCGATG ATTTTGAAGA CGAGGATGGC TTTGACGATG 
ACGATGACTT TGAAGACGAC GATTTTGAAT ATGAAGATGA GGACAATGAC CTAGACTTTG 
ACGAATAGGA CGCACGCGCG GGTGTGGTTG TCGAGGCGAC ATGATCGCAT TCCTGTTGCC 
TGTGATGCGA GACTGCTAAG AAATCTTAAT AAAAAAGTTT TTGATAAAGC GTGC GCGTTC 
GTCTGCCTTT TTCCAGTATG GGCTGTGGGG GAAGCGTTCC AGTATTGTCT TGTATGCTTC 
GAGCGCGAGG CGTACGTTTC TCTGTGCGCC GTTGATCTCA TAGGCTTGTC CACGCAGGAA 
CCACGCTTCG TCCATTCGTT CGTGAGAAGG GAACTGCGCA AAGAAATCGC CGAGCGAAgn 
GgGGCATCTC GCGCGTTTCC CTGTGCACAA AACTGGCGCG CTTCTGCTAG GTGATCGCGT 
TTTTCTTGAC CCTCTTTGTG AGCAGATGCC GGGACATGCG CCTCAATAGG CGCAgCTGAG 
GGAGCAGAAG GTTGAGACGC TGGTGAAATC TTCCGCGGAG AGTACCGCTC CGACACACCC 
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TGTGCTACTG CATCCGACGG CACAGGGTCA CGTCCTCCTA CGGTGGGTTC CCGATCTTTT 9600 

TTTTC A t C AG GACGGGGCGT ACCGTGCTGA GCTTTCTCTG CAACAGCAGT ATCCGTCTGA 9660 

TCCTGCCGGA CAGGCGCTCC AGTATGGGCA GCGGCGCGCT GAGAACCAGA AGTTCCACTC 9720 

TCTTCTGCTC TGCGCTCGGT ACCCGTTCCC GCAGAGATAA CTCAGAAACG ACAGTATCAG 9780 

GAGGAGACGA CACCGTACGC CGGTAC TCAG GCGCACGCAC CACACGCGCG AGCCCTTCCC 9840 

GCTTCGGTAC CACCTTGACC GCAAGTGCGT CGGAGACAAA ATCACCCCGA AACACATCAA 9900 

AATAGGAGAA CGCTAAGACA AAATCACCCT CTCGCTCAGC ACTAAAGGTA AAAAGCGAAT 9960 

GCGAnCTCCT CCAACTTGCG CTGGTGATAG CGCAAACCAG GCTGC GCAGT ATGCTCGCCC 10020 

ACGTACACCC AACCTTCGcC CGGaTACAAA ACCTcAAGTT TTTGcCCCAC TGcAAGcTGT 10080 

aCCGcGCGCG AAACGgGGCT ACCTtCATCC TtCAGGgCGG TTCTTcAGGc ACCATCGCGc 10140 

GTGGAGAATC CTcTGcCGGC TCAGGCTCAG CCTGcAcCTC CGcCTCACGA GGAGGsTCTG 10200 

ATGCAGGGGG CGGA 10214 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 660 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : doubl e 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

CTAATGAAGG CGATGTTTTC TTCTGAAAAG GACCCGTGGT ACCT ACTCGG CGCGGGGGTT 60 

GCGTGCGGTT TGGGAATTGC CGCTTCGGcG CTTTCTCAAG GGCGGGCTGC CGCAGCCGGC 120 

GCCGATGCGC TTGCAGAAAC AGGTAAAGGA TTTAGCCAGT ATTTGACTAT CGTTGGTTTG 18 0 

TGTGAGACGG TGGCGCTTCT GGTGATGGTT TTTGGTATTA TCAACTGCTA GATGTGGTGA 240 

ACGTTGTGGT ATAGCGCTTC GACCATGCTT TTGATAGACG TAGGGAACTC GCACGTATTT 300 

TCGGAATCCA AGGCGAGAAT GGTGGCCGTG TGTGCGTGCG TGAGTTGTTT CGCCTTGCGC 360 

CTGACGCGCG TAAAACCCAA GATGAGTACT CGCTTCTCAT CCATGCGCTT TGCGAACGTG 42 0 

CGGGGGTCGG CCGTGCTTCT CTCCGTGATG CGTTTATTtC CTCCGTCGTG CCTGTGTTGA 480 

CAAAGACCAT TGCAGATGCG GTCGCTCAGA TTAGCGGcGT CCAGCCG t TG TCTTTGGCCC 540 

GTGGGCGTAm GArCACTTGC CGGTGCGCAT ACCAGAGCCA gTGCGCGCGG AAATTGGCAC 600 

TGACTTGGTA gCCAAmGCGg TGGCGGCCTA TGTGCAnTTy CGTTCTGCTT GCGTGGGTAT 660 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATTTCCACAT TACTCAATAA AAAGACCCAG GATTTTAAAA AAAAATACCG CTACACCGCG 60 

GATGTACTTC TTATAGATGA CATTCATTTT TTTGAAAACA AAGACGGATT ACAAGAAGAG 120 

CTTTTCTATA CGTTCAACGA ACTTTTCGAG AAAAAAAAAC AAATTATCTT TACCTGCGAC 180 

AGGCCTGTAC AAGAATTGAA AAATCTCTCT TCTCGCTTAc GCTCGAGGTG CTCCCGAGGG 240 

CTTAGCACTG ATCTGAATAT GCCATGTTTT GAAACGCGCT GTGCTATCTT GATTAAAAAA 300 

ATACAAAACT ATAACAGCAC CTATCCTCAC AAAGCCATCC ACATTTCAGA CGATGTTGTC 3 60 

CGACTTGTTT CTGAAAACAT TTCTTCAAAT ATCAGGGATC TTGAGGGGGC ATTAACAAAA 420 

ATTATCGCTT TCATTGAAGT GTCGGGATCC ATCACGATAG ATATCGTTCC CTCTCTCCTA 480 

AAAGAGTTCT TCCTCTCTGC AAGGCCAAAA CACATCACAG TAGAAACTAT TCTTCATGTA 540 

GTTGCAGATC ACTTTAACAT TTCGTATTCA GATCTAAAGG GTAAGAAACG CAATAAAAGC 600 

GTTGTTTATC CTCGGCAAAT CGCTATGTTT CTCTCAAAGG AACTGACAGA GCTCTCCACT 660 

ACTGAACTTG GTATCGAATT TGGTGGCAGA GATCATTCAA CCGTCATTTA CGGATGTCAA 720 

AAAATAGAAG GAGAAATTCT CACTAATCCT TCGTTACAGG CAAATCTTGA TTTGCTGAAA 780 

AGTAAAGTTC AAGATTCAAT CCG CTAGGGC GTAGACACTG AATTCGATGG GGATAAGTGG 84 0 

TGGATAAAaG AATATAAATT AGTCATTACA CTTTACTCAC GAATATCCCC CTTTTTTTAG 900 

AGAAAAAATA TACTTTCTTC ACAaGCTTGT GTGCGGTTTT TGTTTGGTAA TTCTCGAGAC 960 

ATAaGCACTT ATCCAGATAT TCACAGTTAC TATTATGTGA TACGACTACA TTCTTTATAC 1020 

TTATAAGATT AATAAGGAGG AAACTAACTG TGAAAATCCT ATGCGAGAAA GAAGCCTTTC 1080 

TGAAGGAAAT AAGCACAGCA CAAGAGGTTA TTTCAAATAA AAAAAACACG TCTATTTTTT 114 0 

CGAACGTCCT ATTAGCTGCT CAAGGAGCCC TGCTTACCAT CAGAGCAACC GACACAAAAG 1200 

TTACCTTTGA AACTAGCATT CCCGTCAATG TTCTCGCCGA AGaCaACGAC AGTTTTTTGC 1260 

GACAAACTTG TGAATGTTGT TTCTGCCCTT CCAACAAAAG AAATCGAATT AACGTTATGT 1320 

GAAGAACAAC TTGTCATTAC cCCTCCAAAC AAAAAGATAA GCTTTcAGCT CAGAACCCTC 1380 



Printed from Mimosa 02/03/22 07:25:18 Page: 480 



WO 98/59034 






479 






J/13041 


TCGCATGAGa 


GTTTTCCATG 


TTTCCCTCAA 


AATGAAGGAG 


GCGTCTCTCT 


TGCTGTGCCT 


1440 


ACCTCCGATC 


TTAGAAACAT 


GATTAACCAT 


AC CGTTTTTG 


CAGTTTCAGA 


AGACAGTACG 


1500 


CGCCATTTTA 


TCAATGGCGT 


ACACGTTGAT 


TTTCAGTATG 


GAAATATTAT 


TTGTGTTTCA 


1560 


ACAGATGGAA 


AGCGGCTCGC 


CTATATAGAA 


AAAAAGGGAG 


AATCCTCTCC 


CCAATCCTTT 


1620 


TCGGGTGTTA 


TTGTGCCAAC 


TAAGATCTTA 


GGCATAGTAA 


ACCGTAAGCT 


TACCCCTGAA 


1680 


GGATCAGTGA 


CGCTATGCAT 


TACGTCGCAG 


CACGTTTACT 


TTTTTTTCGG 


TGGATATAAG 


1740 


TTTTCTTCTG 


TGCTTATTGA 


GGGGCAATTT 


CCTAATTACA 


AAAGAGTAAT 


CC CTGATCAT 


1800 


CAGGAGCGTT 


CTTTTTGTGT 


TGGACGTGTG 


GAGCTAATGG 


AGGCACTTAA 


ACGAG TCTCG 


1860 


TTGTTGGTAG 


AACAAAAATC 


TCACAGGATA 


TTTATTACCA 


TACAGCAGGG 


TTTGTTGACT 


1920 


TTAAGCTCAA 


AAGC TC AC AC 


TCAAGAAAAT 


GAAATAGGTG 


ATGCTCAGGA 


AGAAATAGCC 


1980 


TGTGCTTATA 


CAGGAGAAAG 


TGAGGTCATA 


GCTCTTAACT 


ATCTATACCT 


TGAAGAACCG 


2040 


CTTAAGGTTT 


TTACTTCGAA 


GGAGGTTCAA 


GTGGAATTTA 


CCGATCCTGC 


AAAAGCACTC 


2100 


ACGCTTCGTG 


CTGTACCAAA 


CACGGACTGC 


TTTCACATCA 


TTATGCCTAT 


GCAAACGGAG 


2160 


TGATTCTTTG 


CC TTTTCTC A 


CAGTGACTGC 


AATAAATTTC 


AGAAATCTTG 


CACATCACAC 


2220 


GATTGATATA 


TCCTCTCCTG 


AGGTTTTTTT 


TGTGGGAAAT 


AACGGACAGG 


GAAAAACCAA 


2280 


TATACTTGAG 


GTTCTATATC 


TTGCTGCGTA 


CGGAAATTCG 


TTTCGAACAC 


GCACCGAAAG 


2340 


CGAACTGTAT 


GCAACTCACG 


CGCGTTCGAA 


TGAGTATCGG 


GTAAAAGTTA 


TGTACCGCGG 


2400 


GGAGTATACC 


CACACAGTGC 


AGATTTTCTC 


CAAAAATGGA 


AAAAAGCGCA 


TTGAGAAAAA 


2460 


CTTGAAAAAA 


ATAAGGACAA 


AAAAAGAACT 


TATCAGCAGT 


ATTCCCTGTA 


TTTTGTTTTT 


2520 


TCATAACGAT 


TTGGACTTCG 


TAGTTGGTAC 


GCCAGAACGC 


AGACGCTTCT 


TTTTGGATCA 


2580 


ATCCCTTTCG 


ATGTGTAATC 


CTCTGTATTT 


GGAATACTTG 


CAAAAATATC 


ACGCACTAAC 


2640 


AAAAACAAAG 


AACAGAGAGA 


TAAAAGAGAA 


ACGCGTTCAG 


TTACTCGATG 


CACTGGATAC 


2700 


GCAAATTGCA 


ACCGTGGGTT 


TTGATCTCGT 


GCAGTGGAGA 


ACTCAGCTTG 


TCCGTGACTT 


2760 


TAACGTGATT 


TTTACTAAGT 


ATTATGAGCG 


CCTTGGAGAC 


CTTGCGCAGG 


TGCGCATTGA 


2820 


GTATAAGCCT 


TCATGGTCTG 


ACTCCTCAGT 


TGAGGAGATC 


GTACATTCTC 


TTTACAAGAG 


2880 


ACGTAAGCAC 


GATCTTGCGA 


TGGGGATGAG 


TATGTCAGGT 


CCTCATAGAG 


ATAAGATTCA 


2940 


CTTTACTCGG 


TCGCAGGCGC 


TTTTCATTCC 


TCAGGCTTCT 


AC C GG AC AGA 


GGCGGTTGGT 


3000 


TTCGTTGGTA 


CTGAGGATGT 


CGCAGGCTGT 


GTTCTACACA 


GGaGTAACGG 


GAAAACTGCC 


3060 


CGTACTCTTA ATGGATGATG 


TCTTGTTAGA 


GCTTGATCCT 


GAGAAGCGGG 


AAAGGTTCAT 


3120 
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GATGAGTTTG CCTCCGTATG ATCAGCTGTT TTGTACATTT TTGCCAGGGG AAGCGTACAG 3180 

GCGATACGGG CGTGAAAAAA CGCGGGTATA TTTTGTTTCT GAAGGGGCGT GTCATGAATA 324 0 

ATGGTGTGAA TAAGCTATCG GAC TTACTCG TGTTGACCAC TGAATATATC CAAGCTTCCT 3300 

ATGAAACGGA GGCGTTTGAT GCGCATCGAG AATGGGTGTG TATTGTGGGT AACCCCGTTG 3360 

CGTTACACAG CACGCTGGTA GATATCAGAA ATGGGAAAGT TGTGGTCAAG GTGACTCATC 3420 

CTGGTTGGGC ACAATACCTT TTGTTAAAGA AAGACGAAAT TGTACATGCC CTTCGTAGGC 3480 

GATATCCGTC GTTGGGAGTG ACGGGTATGA GTACGTACGT AGATTCTACC TCACGTACCC 3540 

CTTCTGCGAA GAAGGACATG CAGGGACTTT CGGTATCAGA AAAGCAGACT CGTCCTGTGC 3600 

CTGAACTTGC CGAGGTATTT GAACAGCTCC GAACGC TTTT TCAGGTGAAA ACGGAAGAAC 3660 

CGTCACATTA GTTTTGCGGA TGGGATTCGA CGGATCTGTT CAAAGTCCAT AGGACTGCGG 3720 

TTTTTCTTGC GTGCAGCCTA TGCACGACTG TGTCTCTCCT TGAACGCAgT ATGGCTTTGC 3780 

GTTAGAATGC CCGCCCTATG GAAGAAATTA GCACCCCAGA GGGTGGCGTT CTTGTGCCCA 3840 

TTTCTATAGA GACAGAAGTC AAGCGTGCTT ACATAGACTA TTC TATGTC C GTCATAGTTT 3900 

CTCGTGCGCT TCCGGATGTC CGCGACGGTT TAAAGCCTGT TCACAGACGT ATTCTCTACG 3960 

CGATGGAGGA AAAAGGGcTA CGCTTTTCAG GACCTACACG GAAGTGTGCC AAGATAGTGG 4020 

GGGACGTTTT GGGAAGCTTT CATCCTCATG GGGATGCGTC CGTCTATGAC GCGCTAGTGC 4080 

GTCTTGGGCA AG ATTTTTC C CTTCGTTATC CAGTCATTCA TCCTCAAGGA AATTTCGGGA 414 0 

CTATCGGGGG CGACCtCCGG CAGCGTATCG GTACACCGAA GCGAAGATGG CGCGTATTGC 4200 

AGAATCTATG GTAGAGGACA TAAAAAAGGA AACGGTTTCC TTTGTTCCCA ATTTTGACG A 4260 

TTCTGACGTA GAGCCCACGG TTCTTCCTGG AAGGTTTCCT TTTCTTCTTG CGAATGGGTC 4320 

CAGTGGTATT GCAGTTGGTA TGACTACAAA CATGCCACCG CATAATTTGC GTGAGATAGC 43 80 

CGCAGCTATC TCTGCGTACA TCGAGAACCC AAATCTTTCG ATTCAGGAGT TATGCGATTG 4440 

TATCAATGGT CCTGACTTTC CCACGGGAGG CATTATCTTT GGAAAGAACG GGATTAGGCA 4500 

GTCTTACGAA ACAGGTCGAG GGAAAATTGT TGTCCGTGCT CGCTTTACCA TCGAGACGGA 4560 

TTCAAAGGGT AGGGATACCA TTATTTTTAC AGAAGTTCCG TATCAAGTTA ATACTACCAT 4620 

GCTTGTTATG CGTATTGGGG AACTTGCACG TGCGAAAGTG ATCGAAGGTA TTGCGAATGT 4680 

AAACGACGAG ACTTCCGATC GTACAGGsTA CGCATAGTGG TAGAGCTCAA AAAGGgTACC 4740 

CCCGCACAGG TAGTACTCAA TCACCTGTTT GCAAAGACTC CCCTGCAGTC CTCTTTTAAT 4800 

GTGATTAATC TTGCTTTGGT AGAGGGAAGA CCTCGAATGC TCACGCTCAA GGACCTAGTG 4860 
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CGCTACTTTG TAGAACACCG GGTCGATGTA GTGACTCGGC GTGCGCATTT TGAATTACGT 4920 

AAGGCTCAGG AGCGCATACA CTTGGTGCGT GCGCTGATAC GTGCCTTGGA TGCCATTGAT 4980 

AAAATCATCA CGCTTATCCG TCATTCGCAG AACACAGAGC TTGCAAAACA GCGTTTGCGT 5040 

GAACAATTTG ACTTTGACAA CGTGCAGGCG CAGGCGATCG TAGATATGCA GATGAAGCGC 5100 

TTGACAGGTT TGGAAGTCGA GAGTTTGCGT ACGGAATTGA AAGATTTGAC GGAGCTGATT 5160 

TCTTCTCTGG aGGAGTT AC T TACTTCTCCC CAAAAGGTCT TGGGAGTTGT TAAGAAAGAG 5220 

ACGCGTGATA TCGCAGATAT GTTTGGGGAT GATCGGCGTA CAGATATTGT GAGCAATGAA 5280 

ATAGAATATC TGGATGTAGA AGATTTTATC CAGAAAGAGG AAATGGTTAT TCTTATTTCC 5340 

CATCTTGGTT ACATTAAGCG CGTTCCAGTG TCTGCGTATA GAAATCAGAA TCGGGGAGGA 5400 

AAgGGCTCAA GTTCAGCGAA TCTGGCGGCT CACGATTTTA TTAGCCAGAT ATTTACTGCA 5460 

TCAACACATG ACTACGTGAT GTTTGTCACG AGCCGTGGGC GrGCCTATTG GCTAAAAGTA 5520 

TACGGGATTC CTGAATCTGG TCGGGCGAAT CGTGGTTCGC ATATTAAGTC GCTTCTCATG 5580 

GTAGCGACGG ACGAGGAGAT CACGGCCATC GTATCTTTGA GAGAGTTTAG TAATAAAAGT 5640 

TATGTTTTTA TGGCTACTGC GCGAGGTGTA GTTAAAAAGG TAACTACTGA TAATTTTGTG 5700 

AATGCGAAGA CGCGCGGTAT TATAGCGCTT AAGCTGAGCG GAGGTGACAC GCTGGTGAGC 57 60 

GCAtGTTGGT GCAGGACGAA GATGAAGTAA TGCTTATTAC GCGTCAGGGA AAAGCATTGC 5820 

GCATGTCGGG GAGGGAGGTG CGCGAGATGG GTCGCAATTC CAGTGGGGTG ATTGGGATAA 5880 

AATTGACGTC CGAGGACCTA GTGGCGGGGG TTTTGCG AG T AAGCGAACAA CGGAAAGTAC 5940 

TGATAATGAC GGAGAATGGA TATGGTAAGC GGGTCAGTTT TTCAGAATTT TCTGTACATG 6000 

GGCGAGGGAC TGCAGGACAG AAGATTTACA CACAAACGGA TAGAAAAGGT GCTATAATAG 6060 

GTGCTCTTGC TGTTCTCGAT ACAGATGAGT GTATGTGTAT TACTGGTCAG GGAAAAACGA 612 0 

TTCGCGTGGA CGTGTGTGCA ATCAGCGTGC TGGGGCGTGG TGCGCAGGGC GTGCGTGTGT 6180 

TGGATATCGA GCCATCGGAT TT AG TAG TAG GACTTAGTTG TGTAATGCAG GGGTAATGGG 6240 

CTCTGGGGTA TATTTCTCCG TGAGTGGCTG TGTATATGTT GTGAGTATTG TGGATAATGT 6300 

GCGTGCAGAA GTTGATGTTT CACGTGAAAC TgTsGGGATG AGGAGTGGGA TCAAATCTAC 63 60 

CCTAATTCTG GAGGATTATT TGGGTTCACG TTCATGTAAA CTTTATGGGG GTTGTGTATG 6420 

GGGACTCGTG TCAGATTTTC CTTCTGCGGT ATTGCAGGTG TATGTTTACT CGCACTAGGT 6480 

TTTTTAGTTA GTTGTTCTTT GCAATCTTCA CGAAGCGCTA CAAAGAAATC TGAGGCGCGG 6540 

AGGACTTCTT ATCGGATCGG TCTCATGACA AGTACGGGAT CTyAGTCTGT AGATGATGTC 6600 
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CTTGCGAAGA CACGCCTCGT CAGTATCTAC GGAGAGGCTC GTGGGGAAAC GGGTGGAAGG 
ATTGTCCATG TT AC TT AC TC CGATAACTTC TCCCACGACC ATGAAGCAAC CGTTTCTAAG 
TTGCTTGCAC TCGCTGAGGA TTCGACTATA AAGGC C ATTG TGGTTAGTCA GGCAGTTCCC 
GGCGTTTCAA AGGCGTTTGG GATCATTAAG TCTAAACGTC CTGATGTTTT GCTTTTTGCG 
GGAGAACCAC TTGAGCCGGT AGAGATGCTG CAGGAGTCTG CAGACATCGT GGTCAGTCAG 
GACTACTTGT TCGGTGGATA TGCCGTTCCG TGGGTTGCGG AAAGGATGGG GGCGCGCACA 
TtGGTGCATG TCTCTTTTCC CCGGCATATG TCCTACCCCG GTTTGAGGGT TAGGCGTACG 
GTGATGAGGG CAGCATGTAC CGATTTGGGA CTTTCCTTCG CACACGAGGA AgCGCCTGAt 
CCTGTAGAcG GTGTCAGTGA CGGAGAACTT GAGGATTTTT TCCACAAGAC GATTGTGAAG 
TGGATCAAAA AATATGGCAA GGAAACCCTG TTCTAcTGCA CCAATGACGC TCACAACAGG 
CCGCTCATCA GTGCCTTGTT GAAATATGGC GGTATGCTAA TTGGTGCAAC CATCTTCGAT 
TACGCTGATG CGCTCGGGGT GCATTATGCT GAGCTTGAAG ACGTGTATAA AATACGAGAG 
AAGGTTGAGA AGTCATTGGk TTCTTCGGCG CAGAGGGGCG CTTTGGATTA AATTTAAATG 
CACAGGCATT TAGGGTGACC ATGGGTTTTG TGGAGTATGC GCGCAAAATC ATAGATGGCG 
aACCGCGTAA AGATGATATG CGTGAAGCTC TTGCCGAATC CTTCGACTTG TTTACGCGTG 
ACGCACATTG GCGTATTGCT CCTTACCTAA GACTGAAAAC GCACGAAATT GTTCCGAATC 
ACGTGCTGGT GTATACGGAC AC ATACGTC C TGGGTAAATT TACCTTGCCC GTCACAGACC 
AAGTACTCCC AGAAGGGTAT TGGGCATTGA CCGCTAAGGA ATAAGAACTC CGTTC GGGTT 
TTCTGTTTGT AGCCGGGGAG ATGGATCGCT TTCTCTGTTT GGCAATGTCG CCGTCTCCCT 
GGGTCACCAA GTGATCTTGC ACCCTAGAAA GAGTGAACCG GTGTATCCAG GCCAGCTCCA 
GTTCTCTTCT ATCAACATGT AGGGATCCTG TGAAAGCAAC CCTTGCTCCC ACCGCACGGA 
AAACTCCACA GGTTTGATAG GACTTGCACG CAGCTCAACA GCGTATTGGA AACAAAGTTC 
TCCCTTTAAA TTGCGCGTTC CTTTGAACCC ATTGAAATTG AATCGGTTGG TTGC C AT AT A 
TATGTGCGCA CGTGGTTCTA TCCACATACT ATCGTAGCAC GGTATGCGGT AGCCTACCCA 
TGCATTCCCC ATTATCGGAA GGGCTATTGA AgCTGCGCCT GTTGCTATCG CGTCTGCCGG 
TAACCCTGCC GCGCGTGCTA CAAAGTTTGT GGCACTATTC AAGAAATTAA GAAGACCTAA 
AATACCTACT CTGGCAGCAT TGGCGACCTG CTGTGCTACC CCTTGGGCAG GTACGACGTC 
TGGGGGGAGA CCTCCGTTGT CAAGATAACT TTTGTAGCCC AGGGGAAGGT ATACACGTGC 
TTCTATGCCT GCGTTTAGGC CGTGCAAGGC ATGGGTGTAA TCATCTCCCG AACGAGTTTC 
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TAGTCTGAGA AACGCAGCAA AGTCCGTGTA TTGAAAAGTT GACTTTACAA AGGGACCACT 84 00 

CCCAAAAACA GACGCCGCCC CTGTTGCGCC GTATACACCT CCTGAAAGCC AACGcCACTg 84 60 

cGCTGTAACC AGCGCGTCTA TGCCTAATGA GTCCACGTGC TGTACCATCC AGCTGAATAT 8520 

ACGACGCACC GTCCGTAATG ACGGATACGT CTTCTCCGCG AGTTCTAGTG CCTGTTCGAC 8580 

GACACGTGCT CTCGcACTGT TCGTATCCCG GTAGGTATTT CCGGCATCCG nAGCTAAAAT 8640 

GAAGCGGA 8548 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6993 base pairs 
{B) TYPE: nucleic acid 
{ C ) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CACCAnCGTC CCGnATCCAG TTCCACGCAC ATTGGCAACG GCGCACAAgC GCTCTATCTG 60 

ATCTTTGTGT ATATCAGAGA AAAcGCGAGC ACTGCAGAAA TATCTCCCGA ATTAATTTGC 120 

AATATATTGC ATACATGCCG GAATGGTACC TGGTACGAAA TGCACGGCGG CATACCACGC 180 

ACTTGTGACA ATTCGTAGAT CCTTTTATGC CGCATAAATT CGTGTTCGC T TTTTGCCGCA 240 

TGTATTCCCC ACGCAACACG CTCACTTTTA TCGTAATCCT CGTATATCTT AAGAACATCA 300 

AGATCAAAAC TAATGGAAAA CTCGGTATTT GGACGTGTGG AAACAAAAAG GTATCGCAAT 3 60 

ACTTCAGGCT GATATACTTC AAGCACATCA CGCAGCCCAA CCACTTTTCC CGCGGACGAA 420 

GACATCTTCC CAGGCAAACC TTTTAATCCA ATAAAATCAT AACGAAAAGA AACAGGCGCA 480 

GGCCAGTGAT AAATGTGATC AGAAATTAAA CGCGCAGTGT CAAAAGAACC TCCCTGAGAA 540 

TGATGATCCT TCCCTGCAGG CTCAAATACC ACATGCTCCT TACTCCACCG CATAGCCCAA 600 

TCAACGCGCC AG cTAAGTTT TACCGCAGAC GTCTGGCGTA AATCCACCTG CTCCCCATGC 660 

CCACACTCGC AATGATACTG AAGACACCAG TGGCTATCCC ACGCATCAAC CGTGGTGCAG 720 

TCTTTATGGC ACGC TGTACA AAACACCGAT ACGGGCCAAT ACGTTCCACT GATTTTATGC 780 

TGCTCATCTC GATATTCGTT TAAAATCGCT TGAATACGGT GCCGATTGTC GAGCGCAATC 840 

TTTATTTCCT GTGCGTATAC CCCCGCCTGG TATTGCTTTG ACTGATAAAC GTATTCAGGA 900 

TAAATACCTA CCTCCGGGAG CGCCGATTCA ATTTCC CGCT cATGGTGCCg CGCGTAc TAT 960 

CTTCCTGCTG AAAGGGATCA GGAACTGAAG TGATAGGCAT GCGAATATAC TGCTTCAATT 1020 
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CATCTTGAGC AGGTACATTG 
GTACAAAGCG CACTGATTTC 
AATCTCTCTG AAATTACCAA 
GTATTGATCA CAGTCAGCAC 
TGACTTTTCA CAGATACTCA 
AGCCAGTACG CACTTCCCAC 
ACACACAACA GCCATACGGT 
GCGATACAGG GAAAATAAGA 
ATTGCTAGAG GGTGCATCTC 
TCAGTATCCG GTCGAACCGC 
AAGAAAGAAT CCCTGTGGGA 
TATGTATGCT CCTTTGACGT 
CGAAGCGGTC CGGACTCACA 
aGCTTTTTTA TAaGGTCCcC 
ATCCAC TTTC C AC TCGTAC A 
ATAGCGAACA CTAC TATTTG 
CTTTGCAATC AGCCTCCGTT 
TTTCTCTTGC GCAGcAAATG 
GGGTACTCGT TGTGCATTGA 
CACGAGCTCA TCTGCACTAT 
TATCCCCTGC TGTCCAATAG 
ATTCACCAGC GCAAGGTTTT 
TAtACGTACA GTGTATTCTG 
GGCACGGCAC GACGCGTAgT 
ATTGCGTTTT TCTCGGGATT 
TGACcTTGCC AGTTGCAGCA 
TATTAGACTG CGATTGCGCA 
CGTACGTTAG GGCGAACAGA 
AAGACACCAA CACGAGAAAA 
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TCGGGAATCC TACGAAAAAC GTCATAATCG TCCCACGAAT 1080 

CCCTGGTCAC GCAGGcCCGC ACTACAAGGT CAACGGAAAT 1140 

TATGTACCGT TCCTGAGGGG GTAATCCCCG ATGCACAGGT 1200 

GTTCCTgATA ATCTgTGCGC AAC CTgTCAG CCCAATGAAG 1260 

TGATCCTTCC TTACAGGTAC GCAAAATATT TTTAAAGCCA 1320 

GCGCACATCC TACACTTCCA CATAGGTGGC AGTGCCGAAT 1380 

GAGTGCTCGC ATCTGCCTGG CAGACCGTAT GGTCACTCCC 1440 

ACTGCGTGCG TGTCAATACC GCGCGCACAG CATGAAACCT 1500 

CTTCTCAGAT TC CTATTC AT GCCGTATCAC TTGATGCGGC 1560 

CACCACATAA CGACCGCCTC TGTGTCCGTC GCCAGCACTG 1620 

GAAAAATGAA ACACATTGTA CACCAACGTA TCACTGCGCA 1680 

ATCGTGAATG AACGCAAGTC TAACAATAGC GCGGCATATC 1740 

CACAAAATAC CTTGtGCGCA CTCACCCCGA GCAATGAAAA 1800 

TTTCTTGGTT TTCTGTACCA GACACCTCGT GCGCGGGAAG 1860 

CCCCCGTTTC AAGAGAGATA AAATACACAC TGCTTTTCGC 1920 

CTCCCGTAGC TGGATCATGC ACTGCGGTAT AG TAATCTAC 1980 

CACTCACATC AGGCAGTACC CGCTCTACGA ATGCATACCC 2040 

ACGTCGGAAG CGGAGAAAAC GGCAACGCAC GTTGGTGTAT 2100 

ACCAATACAC TCTCATTGCA TCCACACTCC TACACACCAC 2160 

TTACATACAC ACCTTCAATA GCAGGAAAGG GTGTGCCCCC 2220 

CATGCATGAA ACGTCCTTCC TCATCGAACA GCAATATGGT 2280 

CTTCCGGATC ATGCTGCACG TGTTCTGGTA ACACGGCATC 234 0 

CGAATCTACG GCAAGAAACG TTGGCGCATG CAGCGGATAG 2400 

AATTGCTGCC GATTGCAACC CTTCAGAGAA CTGAGGAGTC 2460 

AAAAATAACT GCAAGCACAT CCCCAAACGA AtCATTCGCA 2520 

TGTGCCACGT AGAAAATACC GTCTTTCATA CACAGCTGTA 2580 

TACCCCGCGT CGGGGAAATG CAGTTGATTT TCAGCATCCC 2 640 

CGTTGCCCAT GCAATTCACG CCCCATCCAC CGTGTGCAGG 2700 

CTACCCAGCA AGAAAAAAGT AAAAAATCCC AACCGCAACG 27 60 
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GGTGACGCGT CACAACGCTA 
ACCAAAGCTT GCGGnAATGC 
AATTAACACG CGAGAAATTG 
CCCTTCTGCA TCCTTATAGC 
CGCAGAAGTC ACCTCTCCCC 
CCACTTGCGA TACGCAGGCG 
AATTTCCAAT GCAGTAAAAA 
AGAATCACTC GGTGTACAGT 
AAGACCGCGA GAAAACTGGC 
TGCGTATAAA CGCGGTAAAC 
CGTAATTTCT GCAGTTGCTA 
TCCACTCCCC CGCGGATTAA 
AGTGAGAAAT GGCACAAAAC 
CTGTTCAAGC AGCACCCCTT 
CCCCGCTTCA AAATCAACTA 
AAAACTAAAG CATGGAGGCA 
ACCATGGGGA GTGCACATAT 
ATCGGAAATC TGTACCAATA 
CGTTTCAACA CACGCCGAAC 
CTCATTACGC GCACGTTGTA 
TAAGTGCACA ATCGCGTCCA 
TGCATCCACG TTCTCTCTAA 
TTCTGAATGT ACAAGAAAAA 
TACCCATCGC CCCATCGTGT 
GCGGTTCACA TATTACCGCA 
GCACGCCTTC ACTCATATCC 
CACGTCGATA TGCAAGCCCA 
CTTCGAGAGA CTTACCTTGT 
ACTGGTGACG GCGCACGACA 
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CAGGACACGA GGATAGAGAT ACTCAAAACC ACAAAACGGC 282 0 

GCACGCGACC TTCCGCATCC TGCCCATTTT CTAGCAGCGC 2880 

CAAGCGCCGT CCCATTCAAC ATGTGTACAT AATGCTTCTT 2940 

GGACATTTAA GCGCCGCGCC TGATAGTCTG TGCAATTCGA 3 000 

ACGAACCACC CTGGCGTCCA GGCATCCACG CCTCCAAATC 3060 

CACCCAAATC TCCCGCACAC ACTTCCACCA CACGAAAAGG 3120 

TCTCTTCCTC AAGCGACCGC AGgCGTTCGT GCAGGCACTC 3180 

ACGCAAACAT TTCAAGTTTG GTAAATTGGT GCACGCGATA 3240 

CTGcAGCACC AGCCTCTTTa CGAAAACAAT GCGAGAGCCC 3300 

TCCGCTCTTC AAGAACCTCG CCTGcATGGT ATGCCCCCAG 33 60 

CTAAACAGCG GTGTTCTCCC TCAATACGAT AGATATTCGA 3420 

AACCCAAACC ACACACCATA CCCTCACGAG CAATGTcAGG 3480 

CGCGCTCTTG TAAAAACTGC AAACCAAACA TAATCAATGC 3 540 

CACGCTTCAG ATAATAAAAC TTTATCCCCG AGACCTTTTT 3 600 

TATCCAGCAA GCGCGCTAAT TCCACGTGAT CACGTGGcGA 3 660 

CCCCACAGCG CTTGATTTCG AGATTATCAC TGTCTGATCG 3720 

GCGTCATGTT TGGCAACGCT TGCG TTGC AG ACAAAAGCTG 3780 

GACGC TCGCT GTGAGCAATG CGATCTTTTA GTGCTCTGCC 3 840 

GCGCAAGcGC ATCCAAAGAG CTTTTCATCG TCTGTGCGTT 3900 

ATTCTTGCAA CTCTGCTAAA AGCTTTACGC GCTGATCATA 3960 

CATCTGCATG CACGTTCCTG ACCTTCACAT TTTCTTTTAC 4020 

TAAACCGATA ATCAAGCACG CGCCTTTCTC CCCTTACTTA 4080 

CGACACTCTC ATCGAaTGCT GCGCAGAAGC GCTAACAACA 4140 

ACGAATG TGT CAGACGTGGT AGCCGAGCTG TCCGGAAGGC 4200 

TCTCCCACGC TGATGTGATC GTAGTATCCT ACGCGCCTGA 42 60 

TCTGAAACCT TTGTCACGGT AAAATGCCCC AACACGTCCT 4320 

ACACCTTCTT TCAGCACTGA AAcTGCGCGT CTTTTTACCA 4380 

AACTCTGCAT CTTGTGTTCC AAGATCAATC ACCGCcTCCG 4440 

GTACCCATAA TTGGCAAACG ATCGTTGAGC ATCTGCATGA 4500 
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TCCTACGCAA 


CACACTCTGA 


TACCGATCAT 


TTCCCGAGCG ATACGCATCA AAGGTGTGTG 


4560 


CCCGAGATCC 


CGTCGATGCA 


ACATACAATT 


CCAAACGCAC 


GCGTAAATCC 


TGACCGTGCT 


4620 


CCTGCATTGT 


GATGAGAGCA 


AAATAATCAT 


CACCAGCCTC 


ACGCGCCGTG 


CGAAAAGCTT 


4680 


CTCTATACGA 


GTGAGAACGC 


GCGCTGTACC 


CAGTTACTTT 


TTAaCTGCGG 


TTATAGGCAA 


4740 


ACGAATCTTG 


CACTGCGTCA 


GAAAGAAACG 


CTCAGCCTCA 


GGATGCAATG 


CATTCGCAGG 


4800 


ATCAGGATGG 


TAAAAAAGAG 


ATATTGAAAG 


ATGCGCCTTA 


TC C AG AT ACA 


AGGCATCAAC 


4860 


ACGCCACCTA 


TTTTTAATTG 


AACGTGCATG 


CGTCTTCTCG 


TATGCCTCTA 


CTGCATCATT 


4920 


GATGCGTGCA 


CTGCTCTTTC 


CAATAGACTG 


TAAAAACTTT 


AATTGCTCAA 


GGGAGCGCTC 


4980 


AGGATATCCA 


AAACGCAGGA 


GCAAACGTGC 


GTAAgcTTcA 


CGAGCAGCAC 


CGTCGTACGG 


5040 


GTACACCTTT 


AGTGCGCGCC 


GATACTCATC 


CAGAGCCTGC 


CGACTCATAT 


TCCGACGCGC 


5100 


GAAACCGTCT 


GCCTTTTQCG 


TGTGAAAACG 


CGCAAGTTGC 


ATGCGATACT 


CATCTTCGTA 


^1 fin 


TTCAAGGTGA 


ACAATCGCGA 


TCTCTTCTAG 


CAAGATGCGC 


ATTAGATCGT 


CACGTGGATC 




TACTGTCAAC 


CCAACTTTTG 


CAGTTGCAAG 


CGCCTCAGTA 


TGTTTACCCA 


ACTTCAGAAG 


coon 


GG AC AG TG TC 


TTTACATACC 


AGGCATCCAC 


TTGCGTTCGA 


TCCGCCCTTA 


TGCG TTGATC 




ACACTGAGCC 


ACCGCGCGCT 


CATACGCGCC 


GCGCGCATAT 


AAAACTGCTG 


AAAGAAGCGC 


JfiUU 


ACGGGCACGT 


GGATAAGCCG 


AC TT AATGTG 


GAGCGCCCGC 


TCCAAATAAC 


GCTCTGCATC 




TTCATAGTGT 


GCCCGAAGCG 


TTGCAAGATA 


CGCGGCAAAA 


AAATGCACCT 


GTGCATTATC 




ACCGTGATAT 


TGCAACGCAC 


GTTCAACGTA 


CGTGAGCGCA 


CGCGGATAAT 


GACCAGCCTC 


ccon 

D Dow 


GTACGAGATA 


AGCGCAAGcg 


ACAACAACGC 


CTTGCGATTC 


TCTGCCTGAC 


GCTCCAGCGC 


5640 


TGCTTGGTAT 


AACAGACGCG 


CAGAGCTCAG 


CCGTCCCTTT 


GACACCTCAA 


TCTCTGCCAA 




AC C AAAGCGA 


GCATCTACGT 


CATTCGGATA 


GCGCGCAAGA 


ATTTCCTCAA 


AAAGACTACG 




CGCCTGATCC 


AACTCACCTT 


GACCAACTAA 


ACTGAACGCG 


CACAGCTTTT 


CAAGGGAAAG 




ATCCTGCGCC 


ATGAGTTTTT 


GCGCTTTGCr 


CACATGGTGC 


AACGCCTGAT 


CATATTCACC 


j D O \J 


AAGTGCGTAG 


AAACACTCGG 


CAAGACCACG 


ATATGCAAGG 


TTGTAAGAAG 


CATTTTTTTT 




TAATGCTTCT 


TGGTAGAATT 


CGATAGCAGC 


ATGCCAATCC 


(Tv^r>fiv^/^ 7\ f~< t\ rn 
TC CJ I vj CAL A 1 


Otitic 1 1 1L1 


finnn 


TCCTGCTTCG 


TAAAGCTGCA 


CGCCCGTCTG 


AGCAAACACT 


ATGCTGCAAA 


GCGCACCGTA 


DUDU 


ATACACGCAC 


AGCAGGCCTT 


TCATGCTTTT 


TCCCTTTCCT 


CTATGGCCGC 


TCAGATACAC 


6120 


ATGCAGGCTC 


GGAATCACCC 


CGCAGaCAGA 


TACAATCTTT 


ATAGTATTTA 


TCTTATGTGC 


6180 


TTCCATGTCT 


TGGATAATAA 


AATCGAAGGC 


ACCCCACGAA 


ACCTTCTCGT 


ACTTTACGGG 


6240 
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AATTTTTCCA AAAAGATTAA ATACAAATCC ACCTAACGTT CCAAACTCTT GAGAAGGAAA 63 00 

AACAGTATGC AAACACTCAG ACAAATCTTC CAAATCCACA CGCGCATCGC ACAACCACAC 63 60 

GCCCTGTCCG AGCGGTTCGA TATCCTCCCG CTCGTGGTCA AACTCATCCT GGATATCCCC 6420 

AACAATCTCT TCAATAATGT CTTCCATGCA CGCAATACCC GAAACGCCGC CGTACTCGTC 6480 

CACCGCGATC GCAATGTGCA CGTGCCTGCG CTTAAACTCT CGCAGAAGAC TGTCAATTCG 6540 

TTTGGACTCG GGGACAAAGA AgGsTTACGC AGCAGTCTTT CTAACCGCAC CTCCTGTGGC 6600 

CTTCCAAACA GCTTTATTAA ATCTTTGACG TACAGCACAC CCACCACATT ATCAATAGTT 6660 

TGTTCGTAGA CAGGAAAGCG TGAGTGTCCA CTCTCGGTTA CCTTTTCAAC GAGTGTTtCA 6720 

CCGCTCATAG AAAGCTCAAG AAAATCCACG TCAATACGCG GTATCATCAC CTCGCGCACC 6780 

GAAGTGTCAG AAAGATCCAC TATAmCGCGG rTCATAtCCT GcTTTTCTTC ATTCAGCGGT 6840 

TGCTGAAAAA TATGGGTAAC AGCGTGCCTG CGCCTCAACC AGTCTATGAC TCCCATGGTA 6900 

TACCCGATGA TAGCACCCGA CACGTGTGCG CCAGTATGCG CTCCTGCAAA CGCAACATCT 6960 

CTTGTCCAGG GnTCCTnCGA TCAGACTCTA TAA 6993 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

TCGCGnnAGT CAAAAACGGC AACACTGAGT TTTTGCTCAT TGGGGGC AGC CAGGGGTACA 60 

AGGAAATAAA ACTGGAAACG GGGAGCGGCA GCGGTACCGG CTGCC TGAAG GCAGAGAACG 120 

TGCGCGGTCC GGAACAGTGG GGTGAAGACA GTGTCACTCC CAAGGATAGG GTAAGCCAAT 180 

ATGAAGGCAC CATCGGCCGT TTCGCAATCA GCGACATTTA CACCGTTGAG TCCACGAGTG 240 

GAGCTGGTGG CACCAACGGC GGCACTAATA AGCCGGACGT GTAtGTGGTG GTGGGGGATT 300 

CACAAGACGG GTATACGGGC CTGTGGAGAT TTGACGCCCA GAAAAAGGAG TGGAATCGGG 360 

AGTAGCCCGG GCGGATGCGT GCTGCAGGGA GGCGCGGGGC GGGAGGCCGC GCGCCGGTCA 420 

TCTTTACGCT TTGATAAAAA ACAGTTCGTG AATGGCGCGC CCCTGCGTCT GCGCCTTGCG 480 

TTCAAATTCC GTGGCGGGGC GCCAGGGGCg cGCACCCTGC GGTGCCCACG TGAGCGAGGG 540 

CGTGCGCGCA AgcTCTTCCT GCGCGCGCCG TGCGTACTCG GCCCAGTCGG TGACCGCGTA 600 
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TAGGTAGCCA CCCGGTGCAA GGGCGCGCGC GAGTAGGTCT GTGCGCGGGC GATACAGCAG 660 

GCGCC GCTTG TGGTGCCGCG TTTTTGGCCA CGGGTCTGGA AAGAAAATGT GCAGGCCTGC 720 

AAGTGTCTGC GGTGCGATCA TGGTGCGCAg c aCGTc GAGT GCATCGTGCT CGATGATGCG 780 

CAGGTTGTGT AAACGTTCGG CTTCAATTTT TCTCAGCAGT CGTCCGATTC CTGCGCGGTA 840 

CACCTCGATG CCGAGGTAGG AAAGGTGCGG GTTGCGTGCC GCGATTGCCG CAGTTGCGCT 900 

CCCCATACCA AAGCCAATTT CTACTACCAG CGGTGCAGGC GCGCACGCCG GAGCGACTGC 960 

GTCCGTTTTC CCCTGCGGAc GGG a AAAG C A CCGGCAGGCG CAGAAGGTGC CGCCGGTGAA 1020 

CAGAATACGG CAGCGTAGTC GAACACCGTG TTCTGATACG GGATnATCCA GCGGGACGCA 1080 

AGGTGCTGGT AGTCGCGTTT TTGGCATGCG GTCATGCGGT TTGATCTGCG CGTAAAGGTG 1140 

AGAACTTTCC GCATGCGTGC ACTGTCGTTT GTCATGGTGG CGCTTGCTCA GACAGGGCGT 12 00 

CTTCAGGATA TAAACGGTGA GGTTGTGAAA TAAAGCGCCA GGAGCGCTGA AAGTCCTCAA 12 60 

CCACGCACTG CAGGTAATAT GCGTCCTGGT GTGTCGTTTT CAGTGCGTCC ATGGACGCCT 13 20 

CGAAGGTGTG TACTCGGGGA GAGGAAAGGC GCGCATTGGG CAAAGAAGTT AACGAAAGGG 1380 

GATGCTGTGC CACAGTGCGG TAGTCGCGCG TCACAGCACC AGGGAAGTGT GCGTGTGCGC 1440 

AGTAAAGGGA CCGACCAGTG CCGGCTGCGC AGGCGCCGAT AGCGTCCAGC GTTGCGCGTC 1500 

TACAGAAAAG GAAAACGGAA AGGGTTCGTG TGCTGGGTAG CCTGCGCCTA GGGTAACGCG 15 60 

CTGCTGTGCA TATGTGCCTG TGCTGTCGCG CACGTACACC ACGTACGTTC CGGCAGGAAA 1620 

ATCTCCCCAC GGATACTGCA TCCCCCCTAC GCGCAnATtA CGGCGTCCAG GATGGAGTTC 1680 

GTCAGTGGCA ATGTACACGC GCGTGTCTTT TTCCCCAAAG ACCCATCGCA TTCCGGTGCG 1740 

CAnTCCTcTA CTTCAAGGTA AAAGTACTCC TCAGGAGGAG CAGGGTACTC AAGCGTCACG 1800 

TGCAGTGCCa GCGTTGCGTC TGTCTTTCCT GCTTTTTCGA AATACACTCG TTTTAGGGTT 1860 

AGCCCCTCTG TGCGTGCATA AAAGGGCATA CAGGAAGAAA GTACTGCGCC CGCTACACAC 192 0 

GGCACCACAC AAGAAACCAG TACAAACGCG CACTGGCGAG CGAGAACCAT GCGATTAAAA 1980 

CTCAAAAGAC AAATCCACGG TGGTGAACTC AGAGCGTGTA AAACGGTTAA GACTTGACTC 2040 

AAAGCGCACG TTTGCCTCTT TGwCnCTTCG CTCAAATACG AAATGTAGTA CACGCCCAAG 2100 

TTGGCGTGCT TGGCGGTATC TATAGTTTGC TGGCAGTAGT CAACAAGCGT GTCAGTGTCC 2160 

GCATTTACCA CTTTTGCATC CTGTAAGCAA TTGCGCATTT CTTGTGGGTA ATCATTCCTC 2220 

CCATACACGG TAATTTTTAG CTTACGGCGG GTAC sGGCAG TAGACACGCT CCTCCCCTGA 2280 

AGTTTCCACG AAATAAAGAG AGACTCAGCG TTGTGTTTGA GTTCTGCAAC AATACGCCTG 2340 
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CGGATACCTT CGTCGCAGAG CGCCTCTTGC GCGTCTTCAA GGTCTGGAAA GAGCTTTTGC 
ACCTCTTTTT TTAAATTCGT ATTCTCCTCA TTGATGAGGA GCTCTACAAG GACTAATGCA 
ATATACTCAG CAATGCGCGC CTTATTCAAA TATTGCGCAA GGCTGGTGTG AATGCTACTG 
AGCACAGACC CCATGTCCGG TGCGTGCTGA AACTTACTGA TGATAAACCA CGTAAAGGGG 
CGCAGGGTAT CCAAAAACTT TTCGCCGAGG AAAAGAAGCG TATTTTTCTC CTTTGGAGTA 
CGTTCTTCGC ACGCAGCGAG CGAAGCGTGA AGCAAACGAA GAATATGATG ■ CTTAATCTGC 
GTGACGTGTG CTTCATGCTG TTTTAAATGA TT AT AG ATG A AAGGAGCATT GAGATTTGCG 
TGTTCGTCAA TGCTGCTCCC CGGGTTACTG CG ATTCC AC T TTTTAATTAC TTCAGAGTTC 
AGAATCTGTC TGAACACGTA GCAATCGTAT TG AC GGT AG A GGATGGAATG CACGATGAGC 
TTTGAAAGAT CAATAATTTC TTGACGTGAG GAAGCAAACT CAGGCGGTGA AACCTCAATG 
AGGGAAACGT ATCCAGAGAG CAAAAGACCT TGCACCGTTT TAGGTGCAAA GGCATCGAGT 
GCG ATAC CAT AATCCTCGAC ATGCTCAGCG AGTTTTAATT TCATGAGCTT CCTGTTTTGT 
TTTATAAAAA ACTCGCTGCC CTCTTGCGTG AGTACGAGCT TGAGTGGGAG GTTCAGGATG 
CGTCGTTTAT GTTTTG AACT C ATAC TTTTC TGCGCTCCCG TTGAAGTGTG TCCGCCGTCC 
TTTCATTCTA TCAAGGAATG AGGGGTGGGG GATAAAGAGA TTCTACGTAG CAGACGAACC 
GCACCGATCC TTCCTGTGCA TGCAGGAAGG AGTCAGGAGG GGCGGTGGGG AATCAGAAAT 
AACCCAGAGA AAGGCTGATG TTCTGCGCAA AAGCATCCGG AGTGCTGACC GCTACAAAGA 
GGATAGGAGC TGCCGCACGT GAGGCGCCTG TGCATAATCG GGATTGCGAT CCAAAAGCTG 
CGTTAGTACC TCTTGTGCAT ACTCTTTTTC GTTCACTTTG ATTGC TAG TT TCCCAGCCTC 
TAGGTACACG TCCCACGCGC GTGCGTCCTG AGCGAGCACG GCACGGTACG CGCGGAGTGC 
TTCATGCGGC TGTCCAGCCG CTnACATACA CGCGTGCCAC TTGGCACTGT GCCTCGCGGT 
CTTGAGGCTG TGCGGTGGCG GCGC GGCGGT AGTGCGC TAT TGCCGTCTCC CACTGCGCGC 
GCAGGACATA CAACTTCCCT AGATTGCTGT TTACCTCAAA GTTTTGCGCA TCGTGGGCAA 
GAGCCGCCTG CAGGTGTGTT TCCGCCTCTT GCAATGCTCC CTTGTCCAAA TACAGTTTGC 
CCAAATTGTT GTGAGCCTTC ACGTGTGCAG GGTCCCGTGC TGCAGCCAGC TGATACTGTG 
TTAAGGCAAG ATCCACACGC CCTGTTTTTT kCGCTGCAAc aTnAcGCGTA GAGGAAGCGT 
GCCTGCGATC CATTCGCGTA GACGGCCtGC TGCGCGTGCT TTAACGCTTC TTCGTTCCTA 
TCGAGATCAA GCAGCACGGT TGCGAGGTTG TACAAAGTAA GCGCGTCCTC AGAATCCCGC 
TCTAGGATTT CTTGAAACAG ACGCACAGCC TCTTCCTTCG CACCACTTTT TGCCAGGACA 



'13041 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 
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CGTGyGCGTT CACGCGnGGC TGTCAGGTAT AGATCATGCG cTTCCGCAGC AGCCGCGCAT 4140 

TCTTGCGCCT TTTGATAGTC GGCAAGCGCA gCAGACACTT CTCCTTGTGC ATCGTGCACA 42 00 

CGTCCAAGCT CTATcCACGC ACGCGTGTGC GTTGGGTTCA ATCGGATGAC CGCCCTAAAC 42 60 

GCCTGCAGGG CTAGATCGTG TTTTGCACGT ACCCTACAGG TGAGCCCCAG ATTAAAAAAT 4320 

GCAGGCTCGA ACTTTGGATT CGCAACCGTC GCCGCATTGA ACGCTTCCTG CGCCTCAGCA 4380 

AACCTGCGCA CTGCGAAAAG ACGCTTACCC AACTCATAGC TGTAGCGATA TTCGCGCCGG 4440 

TCAAGCGCCG CGGCGcGTTC AAGCAACGTG AGAGCCGTTG TTTGGTCGTC ATCAGGctGC 4500 

GCATCTGCAA TGCACGCGGC AAGGTAGTGT GCAGCTGCGG AgCGTGGGTT GAGCCTCAAG 4560 

GCTTCCTTCA CATACACGGT TGCCGTTTCA AGTGCCCGTG TCCGCTCAAA ACCGTCACGG 4 620 

TTATCGTGCT GTGAAAGCGC GTACATAGCT TCTCCCATAC GGGTGTATGC GTCTGCTGCG 4 680 

AACAGCGCGT CCCCGGCAGG GAGTGCACGT ATTGCTTTGT TAAACACACG CACCGCTCCT 4740 

GGATAATCAC GTCG TTCCGT CAGTTCTTTT CCTTCGGAAA GCAACGCGTg CACGTGGTGC 4800 

TGTGGCGTGG CAGTTTCTGC AGGTGCGCGC ACCTCCGGGC GCGTTGCAAT CTGCACGGCC 48 60 

CGCTTTATCG ATTTTTCAGG AGAAAAAGAC GCCGTGCGAG ACACTCCCCG GGGCGGGGTG 492 0 

AGAACACGCG CGCCCTTTTG CCGCGTGTCT TGTTCTTGCA TACGCTCTCT CGGCGTAGGC 4980 

GCGTAAGGAA CTGGGCGCTG GGG AGC TAAC TC CTCTG AAA GGGTCTGTAG GAACTGTTCT 5040 

TCATCACTAT TCCCTGAGAG CTGGACGTGA TGGTCCACAA GCACGCCGGG TTCCTCCCCT 5100 

TGCTCTTGCA GCAGCTGGTT CGTCTCCAGC CAGGCGAGTT CTTGCTCAGA AACGCCCTCC 5160 

TCCTCAAGGA GAGTCTCGCC CGCTGCACGA GGTAGGCGTG CGCGCACCAC CCCCCGGGAA 5220 

AAGAGACTGA ACCCGGCAAC TACCGCAAGA AGCAGCACCA GCCCCGCGGC GAGTGCAATG 52 80 

AACGTCTTGT GCACATTATT CAAGGTTGTG TTCCTCCTGA TAGGGGACGG TGTCCTCCGA 5340 

TCCAGTGGAG AGGGTAnGCG CGTCCTCCGC TTGTTTCAGT CTAAGCGCGC GCTTGAGAGC 5400 

TTCAAACTCC GCCTCCTTGC GCCGGGnTTC CTCCGGCTTC CTTGCGGCGG GnTTTCTCCG 54 60 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
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AAATCGTGCT GATACTGCAA CTTAGTAGCG ACAGGAGTAA TGAACCAATT TTGCACCAGC 60 

GAACCATAGA AAGTAGTGAT AAGCGCATTG CCATGTTAGA TCCCAGCGAG GACTTGTCTT 120 

CAAGCGTTGC AAGCATACCG ATAAGCCCCA TAACGGTGCC CAGCATACCA TATCCGGGCG 180 

CGAGcGCAgC CCAGGAGTTC AAAAGGGAAA TCCACGTATT GTGC CGATCC TCCATGTGCG 240 

TCAACTCGCT TTCCATCAGT GCCTTGATCG CATCTCCGTC CACACCGTCT ACCACGTTCC 300 

GCAAACCAGT GCGCACGAAn TCATCGTCAA AGTCCTGAAT TTCTTCTTCG AGCGCAAGTA 360 

AACCGGTGCG CCGACTTTTC TCAGCAAGCG CGTAGAGCCG CTGGACAATC TCCCGTTCGT 420 

GAAAATCCGC CGCATGAAAA ACGCGCGCAA TTACCCGAAA AACACCCACG GCATACGAAA 480 

GCGGATAGGT GAGAAAAAGC GTTAAGTACG AGCCCCCCAC GGTGATCAAC AATGACGGTA 540 

CGTGAAAGAG CCCCCTCGCA GAACCACCGA GCACCGCACC AAAGATAATG ATGGCAAAAC 600 

CGCCGAAAAG CCCGATAAAC GATGCGATGT CCATCGCTTC CCCCGTGTCT TAGGTCTCGT 660 

CGTTGAGGCA GCCGATGcTG CGCCGATAGG AGACAATTTT ATCGATAACT TCTTGCACAC 720 

TTTCCCTCAC CACATAGCAC TTACCCGACA GCATTTGAAG CGTTACATCA GGTGTACAAC 780 

GCATCGTTTC AATGTGGTGG GGATTTACCC AATTTTCATT TCCATTCAGT CGCGTCACTT 840 

TAATCATCCC TCATCCCCAT CACGCCACCT GCCGCTTAAG ATACATTTTC ACACAGTCGA 900 

CACATCAGCG CTTCAAACTC AACACCGTAT CCAACATGGT GTCTGATGTC TGAATCGTCT 960 

TTGCGCCCGC CTGAAACCCT TTTTGGGTAA TGATCATATC CGTAAATTGA TCGGTTAAAT 1020 

CTACGTTGCT CATCTCAAGT GTCCCTGCAA TC AACTTTC C CTTCCCCATC ACCCCCGACG 1080 

TGCTAATGTT CGCTATCCCT GaGTTGTTCG ATTGTACGTA GGTGTTCTCT CCTGCCTTCT 1140 

CAAGACCACC TTGATTTGCA AATCCTGCAA GTGCGAGCTG GCCAATGTCT TGGCTCACCC 1200 

CATTTGAATA CACACCAGTG ATGACACCGC TTTGATCTAT TTTAAAATTT TCCAAATATC 1260 

CCATCGCGTA ACCGTCCTGC CGGTAGCTTT GGTAGTACTG CGTTCAGCAA AcTGCGTAAT 1320 

CGTATTGCGC GCGGTGCCAA TTTCACCCAA GTTGAGCGTG AAAGCGTGGC GCGTAACCTG 1380 

cCCTGcATCG TCCGGaTTCG CACCGACAAC ATCGTACGAC GCTTCAAGGA GCACCTGTCC 1440 

GGTAGGACCG GTCACGTTCC CTGCAGTGTC AGTCACTGAA GCGAGGTGTC CAAAATTATC 1500 

AAAATTTACA ATAAAGGTGT TTGCCGCACC GTCAGATGTC CCCACCCCTA CACGCGTTTG 1560 

CGTATCTACC TCTGTCCCCG GATCCACTGC GACAGTGGCC TGCCACTGAT TGTTCGTCCC 1620 

CGGCACACGC GAAAAGTTAA TCTGCAACGT ATGC TGCTGC CCGAAGCTAT CATACACTTG 1680 

AAAGTCAGTT GTCCACGTGG ACTTACGCAC GTCCGCTTCG TTCGCATCTG CAGCAAGCTC 1740 
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AGGCAGACGC TTGTCTAAAT TACAGGCATA GTGAACAGTG CTGGTCTGTG CGCATCTATC 1800 

TTTTGCCCAA TGGGGATAAC GAGATCCTGC GTCTGTGCAG AGGAATTAAT TAAACGCTCC 18 60 

CCCGCCACGT CCTGCGCCAT CCAACCTTGA ACGCGCATAC CATTCGCAGG GTTCACGAGA 1920 

GTGCCCGCAT TATCAACCCC AAAGGC AC t G CGCGGGTGAA AAACGTCTTT TCCCCACTTT 1980 

TCAGCACAAA AAAACCACTC CCCTGAATAG ACACATCCGT ATTGATACCC GTCGTTTGCA 2040 

GTGCACCTTG CGTGTGAACA GTATCGATGC TTGCAATCAG CACGCCCAAT CCCACTTCCT 2100 

TGGGATTCAC TCCTCCAACT TCTTCATTCG GACGCGCAGC cGcACTCAGT TGCTGAGAAA 2160 

TAAGATCTTG AAAATTAACA CGCCCACGCT TAAAACCGGT AGTGTTAACG TTCGCGACGT 222 0 

TGTTCCCAAT GACATCCATG CGCGTTTGAT GATTCTGCAT ACCAGACACA CCTGAAAAAA 22 80 

GTGACCGCAT CATATGCTCT GTGTCCTCCT CATGTGACTC ATTTTCCTAA TCTCTCTTTC 2340 

TCTGTTCACA AACCATACAA CTGCTACGAC GCACTCGGAT CTGCAATCAC CTTGACGTGC 2400 

TCCCATTCGT ACCAGTGCGA CCCCACCCGC ACCTGGGGCT TGTCAGCACG GGTGACTGCA 2460 

CTGATAAGCC CACGAACAGT GTTATCCGCC TCAGTGACTT CAACCATTTT TCCCACCGCC 2520 

TGCAGCGCTT CAGTATTGCC AAACAGCGTT CCGAGCTTCT CTACCTGCGC ACTCATGTTG 2580 

GCCATCTGCT CGAGCGAGGA AAATTGCGCC ATTTGCGCAA TAAACTGCGT GTCCTGCATA 2640 

GGCGCAtAGG ATCC TGATGG GTAAGCTGCG CAATAAGGAG ATGCAAAAAA TCGTCCTTTC 2700 

CTAACTCCCG CTTCGCACTG CGCGCGCCTG CCTCAAGCTG CTTGTTTATA ACGCGCACAT 27 60 

CCATTTCTAA ACGC GTACGC TCAGCGGCGG TCATTTCAAA CCGCATATTA GTGTTCTGTA 2820 

CCATGCCCCG GCCCTCCTCT TTTTTCACCG GTTGTGTACG CTGACCCCCC TACAGGGTAG 2 880 

GCAAAACCGG CGCGCCTATT TAGGCAAACA CGTCAATCGT GAGCGCAnct CCcTGcGCAT 294 0 

GCCAATGGAC TTCTTGCACA ACAGGCTCAA CGCCCGCTCC CCCAAGACGC TGTGCAGCAG 3000 

cgTAGGCTGC CGTCTGCGAT GCCAAATGCC CGTCCTCTGC GTGCGCACCA GCGCCAAACC 3060 

ACTGCACATC AAACTGCGCA GskTCAAAAC CATTTGCCTC GAATGCACGC GCCAAATCCC 3120 

CCAGATTTTC CTGAAAAGCT TCAAACGCCT CCTGAGAAGC AACGTGAATA GTACCCACCA 3180 

CCCGCTTATT CTCCGACAGG GCAAGACGTA TGCTCACCGC ACCAAGGTGC TCTGGCTTCA 32 40 

GCGCAATGTC GATGTATCCG CGTCCGTGAT CGCGCAGCAC AACCCGTCCA GATTGCGCAA 3300 

GCTCTGCACT ATGgCACGAA TGTGCGccGA AAGAGCCGCC TGAGTGGTAG CAAATCCTCG 3360 

GATCCCCTGC GAnTGCGCTG TCTCCTCACG CGCGTGCGCA ACTCCTTCAA ACGCACGCTC 3420 

CACGCCTGCA CGCTCCCCCT CACGCAGCGA CTCgTACCGT GCTCTTCCGC CCCCGCATCC 3480 
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ACGTCCGCAG CCGaCAGtGC GCCTGCGCCG CAGGAAGTAC GGCGCCCGCG TGCGAAGACA 3540 

CACCCGACCC TGCACCGCGC GAATGCtACG CGCGTCCAGC ACAGTAAAGC GCGCGTCCGA 3 600 

AAAAGATCCC AACTCCTGAG G AG AATCCCC ATGCCGAATC TGCCCGTACG CAGTCGCCCC 3 660 

CTGGAAmGCC GCACCGGCGC TCACCCCTGC AGCAGCAGAG GCGTGAGACG CACCTGCCTT 372 0 

TCCTCCAAAG GAGCTGCCTG CTCCCCCAGA CTGCAAAACA GGCTCACCAC GGATAGCAGC 37 80 

CGCCACCAAC CGCTGCCGCA CCTCTGCATC GAAAATAACG TCAAAGACTT CAGACTCCGA 3 840 

CGATGCCGCG TACGTGGcTT CGCTCCCCTG cTCACGCAGC AGGGCAGGGG C AGCGC C AG A 3 900 

AGGAAGAGAC CCTCCGCGCA ACCCCCGCGC ACC AGCGC TT TCCCGCAGAT GCTCCCCAGA 3960 

CTCCTGCCCA CACAGGTCCT GCACGTGCGG TGCAGTCTCC GGCACACGCT GCGsTaCGCG 4020 

cgAGaCAGTC CTGCAGGACG CGCCGCACGC CCCCGTTCCC CGGATGAGGG CTGCTCTGGC 4080 

ACAACAGACC GCGGCGTTAC GGAAGTTGCC TGCCGCTCTA CTACAAGAAA CTCCGGCGCG 4140 

GACGGCCACG ACGCACCACC CGAAGCTTCC TGCGCCGCGC gnACGTAATC AAAGGGCCGC 4200 

ACTCCTGACG CCGCCTCCTG CGCAGcaCGc AAACCAGTCT CGTACGCCAC GAGAAATACA 4260 

TCCGAAAGAG ACTCTTCTGT GAAAACGTCC TGTTGGGTAC CCGAACCGGT TTGCTGCTCA 4320 

TCAGCTTCCT CGCGCACCCT TTCGTGCACT GCAGCCGACG CAGGGAAAGA GAGACTCTCT 4 3 80 

GTGGCGCAcG CCACGCATGT GTTCATGCAA CGACTGAGCA AAAGAACACG GTGc TGCCGC 4440 

ACACGACGTA CCGACTGAAA TAGTCTCCTG CGCAGCAGGC GcAGACTcCG CCACACCGAT 4500 

ACCAATGGCC CGTGCCAGCA GTCCTCTCAG TTCCATGCAC TCCCCCCACT CCCTGCCATT 4560 

CTCkGcGCAC TGCgCAGaCG TCTTAGAAAA AAGACCTGCC AGTCCGGTTC GGACGCATCT 4620 

TTTCAATACA AAGCGsACGA GAAATTCAgC ACCyTCCcAA AGGsTcCAGC ACGCCGCTTC 4 680 

TTTCCCGTTC AGTATTTCCC CGTTTCCCCT GACAAAAGAT GGATTCTCGG ATACTTTTCC 4740 

CCTCCGTCTA TGGAAAAGGA AAC AG CTGGC ATTCTCTGCT CCTATACGCT TTTTCACAGT 4 800 

GCGCTCGTGC TGGCGCTGTC CCTCGCGCAC GGGCGTACCC AGGTGCCCCC CAGCTCCACG 4860 

CTCAGCTTTT TAACGGTCAT TGTACTCTGG CACTGTCTGC TCTTCTTTTT TCTTGTCGCG 4920 

TATAGCAGAG AACCTGCAGA TACCACCGTG CCGTTTAAAC CGCTGCCTGA ACAGACAGCG 4980 

CCTATTTGTG CCGCCGCATC TTCTGACTGT AAGGAGAACC GCACCGCGCT GAAAACGCTG 5040 

AACACTGCAA CGCACATCAC GCTTATCCGT GCCAGTGCTA TTCCTATCGT TGGCTTTCTG 5100 

CTTAAATTCC ACGCACTGGC GGGGCTTTCT TACTTCCTCG TTGCAGGACT GAGCGTTTTG 5160 

TTCCTCACCG ATTTTATCGA TGGCAAAATT GCCCGCGCAA GACGAGAAAC GTCCCGCGTG 5220 
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GGAGAAACGC TCGACGCAGC AAGCGAC TAC GCGCTTATCG GGCTCATCTC AGCGCTTTAC 52 80 

TACCAAAGCG GTGTGGTGCC CCTGTGGTTC TTTGTGCTTA TCATCACCCG GCTTTCGTTA 5340 

CAAACGGTTA TTGCCTGTGT GTACGCGCTT TTTGGCCACC CGATGAcCGG TTCCACCGCG 5400 

GGGGGCAAAG CGACGGTGGC CGTGACTATG CTCCTGTACA CGCTCGAACT TGCCCGTCTC 5460 

CTGCTGC CG A ACCTTGCGCG ATCAAACAGC GGCGCGCGCT TTTTTACCGG GGCAGAAATC 5520 

TTGCAGGATT CGTCATTTTC ACCGGGATAG TGGAAAAACT GTATCTTGGC GTTCAGCATC 5580 

GCCCAGGACG CTCCCCGTAG GAGAGACGAT ACTTGCGCCG TGCCTTGCAA CACACAAAAC 5640 

CTGTACCAAC CGGGGCAAAA GGAGTGCACG CCCATGGATG AAGGAAGAGA AACTGTCCAG 5700 

CCtgcGCATC GCGCAAAGGA GGAAAAAAAA CAGGACGCCC ATCTTGCATG GGAGGTACGG 57 60 

AAACnGCACG ArGCGTGCgC CTGCGCGTTT TTCACGTGCA AGAACTCGAA AGCGTTTCAC 5820 

CGCGCAAAAC GGTACtCGCT TTGTAACGCT CACTGCACCT GAGTGGGTAA TCGTCGTGCC 5880 

GCACGTGATG GAACGCGCAC AACGC TTCTT CGTTATGGTk CGCCAGTGGC gCTGCGGTTC 5940 

ACAGACGGTG TGTACTGAAT TTCCCGGCGG GGTTATCGAC GCAGGGaGCA CCCTGAGGCT 6000 

GCAGCGCGCA GGaGCTGTTT GAAGAAACAG GCAGACGCGC TTCCTCTCTT GCACACCTTG 6060 

GCACCATACA CCCGAATCCC GCCGTGTTGG AGAACCGCGT GCACATCTTC AGCGCCGAGT 612 0 

GTACGCCTGA GnTACGTGAA CCGCAGTTGG ATACCGACGA GTTTTTAGAG CGGTGCGTGC 6180 

TCCCCGTGCA CGACGTGTAC GAACGCATGG GCCGCGCACC CTTTGACCAC GCGCTCATGG 6240 

cGCAGCCCTC TTTCTTTTTT TGCGGGCGCA TCCGCTTTCC TCCCTGTAAC TCAGTGCGGT 6300 

ACGTCcCTGC AGCGCGTC C A TCTAGGTCGG CATAGAGCGC CGCTC TAAAG GGGGGTATCA 63 60 

TCCCGGCTGC ATAc TCTGCA GCGCAGAGCG TGTTGTGCAG CAGCATCGCG ATAGTCATCG 642 0 

GTCCTACTCC CCCCGGAACA GGCGTGATCG CCTGCACCTT GTGCGCCaTG CGTCAAAATC 6480 

CACATCACCA CACAGTCTTC TCCCGCGCGG TGCAGTTGCA TCTGGCACGT GATGAATACC 6540 

CACATCGATA ACCACGGCGC CGGTGCGCAC AAACGGCGCG CCAATGAAGC GCGCCTTTCC 6600 

CAGTGCTGCA ACGAGGATAT CTGCCTGCAC ACAGATATCC GCCAAACCGC GCGTGTGACT 6660 

GTGACAGAGC GTCACGGTTG CATCACAGCC GGGAGAGGCA AGGAGCACTG CAAGCGGACG 6720 

GCCAACGATG GCAGAACGGC CGACAATTAC CACGCGTGCC CCCGCAAGCG GCACCTGCGC 6780 

ACGCCGGAGC AAGTGCACAA TCCCCGCAGG tGnCAGGGAA CAAACCCAGG CTGCGCAAGG 684 0 

AAGAGCGCAC CACAGTTAAG CGGATGAAAG CCGTCGACAT CTTTTTCTGG CGCCACTGCG 6900 

CGGCACACCC TCGC TGCGTC AAGATGCGCA GtAACGGCAA TTGGATCAAA ATGCCGTGCA 6960 
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CCCGCGCGTC CTCATTGAGA CGAGCAATAA GTTCTAACAC CTGTGCGTGA GAGGCATGAG 
CAGGCAGCCG GTGCGTTTCC CCCCGCAGTG GGCGCGAgCA GGGGCACGCT GCTTTGCTGC 
AACGTAGTAC AAGAAGCCGG GTCATCCCCC ACCAGCACTG CGGGCAAGAA AnGGCGCCGT 
GCCTAcCGCC GCACGCAGCG CCTGCACACG CGTTGCAAGA CGgCCGTaCA CTCGTGTGCG 
GCTTGTTTTC CATCGATGAg GCGTGCGTCC ACGcGCCCAg TATAGACACG CGCACGCAAC 
AGCGCAAAGA CCGAACACGC. ACGGACACTA GACGGAAGCC CAAGAAACAC CGTATGCTCG 
GCGTCGTATG AGCAGAACGT TCCGCGCGTG GCAGTGCGTT GGTGCGCTGT GTGCGCTCTC 
TCCCCTGCTG CCTGCCTACA rcTCCGAGGG CGTGCGAGAg GTACCCCCCT CCCAGTCTCC 
GCAGTGGTGG TGGCGTACGA GCCCATTCGC CCCGGGGATC AGCTGCTCAA AATTGGCATT 
GTTGCAGGCT GCCAGTTGTA CATAGCAGGG GGAAATGGAA CCAACGGCTC TTCGAGTTCC 
GGCACCAACG GTAACGGCAA CGGCAAACTG CTCGGGGGCG GGGGGTTTCA CCTCGGGTAC 
GAGTATTTTT TTACCAAAAA CTTTTCCCTC GGCGGGCAAG TTTCCTTTGA GTGTTACCGC 
ACGAC CGGGT CAAACTATTA CTTTTCTGTT CCCATCACGG TAAACCCCAC GTACACGTTT 
GCCGTAGGcG ctGGCGCATA CCGCTCTCCC TGGGCGTTGG GCTCAACATT CAGTCCTATC 
TCAGCAAGAA GGCGCCGGGG CTTATTGCGG AAGCCAGCGC GGGGCTCTAC TACCAGTACA 
CCCCGGACTG GTCCATCGGC GGCATTGTTG CCTACACGCA GCTTGGGGAC ATTGCAAGCT 
CCCCCGACAA GTGCAGAGCC GTGGGCCTTG CCACCATTGA CTTTGGGGTG CGCTATCACT 
TTTAGCCCCG CCGCCGGGGC AGGTGGCGCG CGCGTCCCTA CTGGATAATG GCTTCAAGCG 
CAATTTCTAT CATTTGGGTA AAGGAGCGCT CCCGCTCCTG CGCGCTAGTT ACCGCGCCGG 
TTACCAGGTG GTCAGAGATA GTCAGAATGC TCAGCGCCTC GCGTCTGAAc TTTGCAGCAA 
GCGTGTACAG CTCCGCCGTT TCCATTTCCA CCGCTAACAC CCCATACCGG GCCCACAGGC 
GCCAGCTTCC TGATTCATCG TAAAAGACGT CAGAGGAAAT TACATTCCCC ACCTGCACCC 
CCGTGCCCAT TTCATCAGCA ACCGACACTG CCGTGCGCAG GAGCGACCAG CTTGCCGTGG 
GCGCAAAGTG CATGCCGCTA AACtGCGCGC GTTTATTGCA GAATCCGTTG CCGCACCCAG 
CGCACACACC ACCGATTTGA GCGCCACTTC CTCCTGCAAT CCACCGGCAG TCCCCACGCG 
GATTGCCTTT TGCACCCCAT AATCTTGAAA CAGCTCCGTT ACGTAAATTG AGTGCGACGG 
CAGCCCCATA CCTGTCCCCT GCACCGACAC GCGCACCCCC TTGTAGGTTC CCGTAAACCC 
GAGCATGCCA CGCACCTCAT TGTAGCAATA CGCATTGTGA AAAAAACGTC CGCCACAAAA 
CGCGCACGCA GCGGGTCACC GGGCAACAGC ACGCGCGGCG CAATATCCTC TCCCTTTGCT 
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CCAAGgTGAA TACTCATCGT CACTCCCTCC CTTCGTGGGG CCTAGACCCA 
TAACCTCGCG CGTGTACGCT CAGCGCATGC AGCACCCGTT ACGCTTTTTG 
GCGTTTATAG ACGCCGCTGC ACGCCGCCCC TGCCCCATCG CACGAATAAC 
CCTAAGACAA TGTCTCCCCC AGCCCACACT CCCGGAATGC TCGTCCGTTG 
ACCACGATAG TACCCCGCTC GCTCACTGCA AGACTGCGCG TTGTCTTTGC 
TTTGAACCAT TCCCAACGGC AACGATCACC GCGTCTGCAG CAAGTTTACA 
CCGCAGGGCA GAAACACACG TTCTCCTGCA TCAATCTGTT CCTGACAATC 
ACCGCGCGCA CGTTCCCCTC TTCATCCCCC AAAATGCGGG TGGTCTGACA 
AACGTCACCC CCTCATCTTC TGCCTGTGCA ATTTCCTCCA CACAGGCGGT 
CGCGTTTTTC TGTACAGACA GTGCACCTGC TCAGCCCCTA AACGGAGCGC 
GAATCTACCG CCACATTCCC TCCACCGACT ACCACCACTG ACTTTGCCGC 
GTGTCCGCAT GCGCAGTGTC ATACGCCTTC ATCAGCGTCG CACGCGTTAG 
GCTGCAAACA CCCCGCACAA TTCCTCACCC TCAATATTCA TAAAGCGCGG 
CCGGTCCCGA TAAAAACTGC ATCAAAACCG TACTGCGAGA ACAGc TGTTC 
GTTC TGCCC A CCAAAAAGTT CATCCGGAAC GTcACCCCCA TTTTCTTGAG 
TCCGTCACTA CC AC TTCTTT CGGCAGGCGA AACTCAGGAA TACCATAGGT 
CCCGGTTTGT GGAGCGCTTC GAACACCGTT ACCGAATGGC CTGcACGCGC 
GCAACTGcAA GACCTGCAGG CCCTGACCCG ATGACGGCCA CTTTCTTGTG 
GCACAGTACG GAACTGTAAT TTGACCATGC TGCCGCTCCC AGTCAGCGAC 
AGCGCACCAA TCGACACCGC CTTGGACACA TCCTTAAACA TCTTTCCCAC 
AATTG AC ACT GACGCTCATG CGGGCACACA CGACCGCAAA TTGCAGGGAG 
GTCTTAATGA TATCAACTGC TTCCTTAAAG GCTCCCCTTT GGACACACGC 
GGAATCGGCA CTCCTACCGG ACAACCCTTT ACGCACGGCT TGGTTTTACA 
CGCTGAGACT CAACCAGTGC CTGCTGCTCT GTAAAACCCA GCGCCGCCTC 
AGCGACCGCT TTTTTGGCGG CAGCATACGC ATACGCTGCA AAGGGATCTG 
TTCATCTTCA GCTCTTTACC CTGGAGCTGC GCCAGGCGCT GGCACGCTTC 
AGTGCGTGCG GCCGATACGT ACGCGCTTCT GGTTCTGACT CAACCGGTAC 
TTGGCATCGC TTACGACATT TTGTACAGAT GTCATACCTA CCTCCCCGCG 
TCTTACAGCA GTGGACATCA TGCGCTTCCC TTGCCTGAAA TGCCCTCATT 



CAcGTTTCGG 
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TGCTCTCAAA ATCAACTTGA T 10461 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

CTTCGCGCGC ATCGACATCC TCAATACCTT TATGGACAAG GCAGATACAG ATTCTGACGC 60 

TTTCAGAGAA ATGTTCGACT ACTTTAACAC ATTTTTGCGT GCGTTTAGTG TCGTGGACGG 120 

CAATGTAATT GCGGCTTACT TGGTGGTAAC GCGTGTTTCC ACGGTGCTGC CTCACCTAAA 180 

TGCGTGTAGA CCCCATGGTT TTGCGGATTT GTACGCGCAT ATTGCGGATC CTCGATTGGT 240 

GTACACAGAG ATAAAGGATA AGGGCCTCAA GTGGGAATTC GTGAATAGTG TGAAAAACTT 300 

TGTGAGCAAT TGGAGCGATG AGTATGTCAA GCTGTTCCCC GAGGTGCTCT CTCTAGAGAT 360 

TCTTCGCGCG CTTATGGAAG AGGGATATAA GGAAAAGGCA CTGAGGGTGG TCGAGGCTTG 420 

CTTTGAATAC TATGCGGATA ATCGTGCGGC GGTTATTGGT TATTCAAGAC GGTAAnGGAT 480 

GAGCCTTGGT TCCAGGGAGC TGCGCATTAC CGCAGAACAG CGGATTATCG TCCTCATCCA 540 

CATTGTGGAC ATTACTTATC GGGAAATCGC TAACCGGCGG AACACCACTG AGAACCGAAA 600 

ACTTAACnAG CAGGCTCTTT CGGTACTCTT TGGGA t GATC ATTTGCyAgA ACACyTCCAt 660 

GCyTTCGCaC GATGTGGGAA CTACTACCCG TCTTTACACG TTATAAGTGA TATCCGGGGC 720 

TTGATCCAAA GTTAAAGGTC CTTTGCGCCA TAAATTATTG AGAAGTACAG GATTTTAAGT 780 

TTTTTGATAC TGAGGAACGT GTGGTTTCCG GACGTGGACT AGTGGTAACT GCAAAGATGC 840 

TCAATGCAAA AAAGAAAGAA TTGC aGGATT TGCTTGATGT TCGTATTCCG GAAAATTCTC 900 

GAGAGATTGG TAGGGCCTTA GAACTCGGTG ATTTGCGTGA GAACGCAGAG TATAAgGnTG 960 

CGCGAGAAGA ACAAACAAGG TTGAACAATA TGGTGACTCG GCTACAAGAG GAGATTGAGC 1020 

GGGCACAGGT ATTCGATCCT ACCACTGTTG TAGCTGGCAG AGTTTCGTTT GGTACGGTAA 1080 

TTAGCTTAAA AAATCACACA AGTGGAGAAG ATGAGACATA CACTATTCTT GGTCCGTGGG 1140 

AGTCGGCTC C AGAACGTGGT ATTATTTCGT AC ATGTCTC C GTTAGGTAGC AATCTGCTCA 1200 

ATCGTAAGAC AGGGGAACAA CTTGCCTTTA CGGTGGGAGA ACATGAAAAG GTGTATGAGA 1260 

TCTTAAGCAT CTCTGCTGCA GAGATCTAGT GAGGAAGTGT GCGATGCGAA TTATGCGGaG 1320 
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ATTAATGTTA TTTC TTATGT GTCTATGTgc TGCGCTGTTT GCGCAAGAGC TGGTTCGCGA 1380 

ACAGAGTGTT ACAAAGTCTG CAGATATTAC GGTGCTACTT GATAC GTCTG GCACTATTTT 1440 

ACCGTACCGT TCCGTGGTAA GCGGTAGTGT GCTAAAAGAT ATCGCTACTC GTTTTGTGCG 1500 

TTTGGGTGAT TCGTTCCATA TTATTTCGTT TAGTGC CACG CCACGTCACG AGATTTCTCA 1560 

GGTTATCCGT AGTGAGTTTG ATCTTTCTCA GGTAGTGTCT CGTTTCATGA TATTGCATCA 1620 

GTTGGGGTTA TATTCTGACT TTTTAACAGC GCTAGATTTC GCGCGTACAC ACTTaCGCGC 1680 

TTTGCCTGCA GCACATGAAA AAATTTTGAT TGTTGTGTCT GmCGGTATTT TTAACCCGCC 174 0 

TGCGCGTAGT TAgTgAAAAA CTACAACAAG GATCAGGTAA AAATTAACCT TGCACGGGCT 1800 

GCCGCGGATC TGAGACGAGA GCAGGTGCGT GTGTTTTACA TAAAACTTCC CTTTCCCCAG 1860 

GACATCCAGA TCCGCGATTT GGATGACAAT CTGCTGACTG ACCTACAAAA GACAGATGAT 1920 

GTTCAAATCT CTGCAGTCGG TAGCTTTGCA GAAGGACAAA CAAGAAGGCC TAAGTTGGAC 1980 

ACTGTGGGTG TGGTTTCCGA TCAAACGGGC GGCGTTGCAG ATAACCATGC AGTTGCTACG 2040 

CACGGAAGGG AGGACGGGAC AGTCCAAGGG GTTGTTGGCA GCCATGTGGA GGTGGCACGC 2100 

ACACAGGACA GACGCATAAT GCAGATCCTG CTAAAAGGGA AGGGGTTCGG CCTTCCTCAG 2160 

AAGCAACTGA TGTTTCCCGC GAGTTCACGG AGGATTTGGG AATCAGGGTG AGTC CGGTTG 222 0 

ATTCAGATGG TTCTGTGCGT TTTTCCGAGA AGGAGCGCAC GCTTc CCGTG TT AC AC TTTC 2280 

CAAGGGTCCT TGAGGTACAG GGTAAGTATG CAGAATGTAT GTTCGAGGTT GAAAATAGCA 2340 

CGGATGCTCC CGTTTTGTTG CATTgGAGCG GGTGATTTTT GACAATGGCG TTGAGACTGA 2400 

CATAGTTTCG GTGCAAACAG AGTCTTGTGC AGTAGCGTCC GGTGCACGCG CGATGTTGCG 2460 

AACAACTTTT TTATT AC CTA AGCGCTACCA CGAAGAGGGA ACGTACCAGG TGACCATGCG 2 520 

TGTACAGTTT GCAGATAACG TCCGCGTGTT CCCTCAGGTG GCAACAGCAG AGCTGCGCGT 2580 

TTCTCCTTTG CCTTTTCTTG GATTGGTGCG GAGAGGTATA CATGGGGTTC TGTCTTCTGT 2640 

AGGGCTTACG CATGCGTTTG GATATGTGTT GGACATGGTA GGGTTGAGTC GCACGGGTTT 2700 

CGGTGCGGTG CTTTTGCCTC TGTTTGCTTT GGCTATCTTC TTAGTACTTG TATCAGCCGT 27 60 

GGTGTGTAGG TCAAAGCGCG TGTTGTCTCG TAAGTCATGG CGCGGAAGTC CCCGTACAGA 2 820 

GAATGGGTGT CAGGGTCCTG GTTCGATGTC TGATTTTCGG GCGCATTCTG TTAAGGAACA 2 880 

AAGGCAGGAT CAGGAGCGCG TGTATGCAGG CATGGAGAGA ATTGTATCTC AGCGTAAAAG 2940 

CGATGTGCAG GATCGCCTCA GTGTATTGAA TGCGGCAACT GCATTTGGGC GTGATCGAGT 3000 

TTCATTTTCC CCCAGGGTAA CGCGTGCGGA gCATGGATGT AGTCGGTCAG GAATGACTGA 3060 
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AATTTTTGTG TTTGATCAAA 
AGGAACCCGT TTAGGGGTTG 
GTTTCCAAGG CGGCTAGCAC 
GAAGCCGAGG TACTTCCCGT 
GTTACCCTTG TCTCTGACAG 
CCCGCTGTGA GATTGAACAA 
AGGAACGAGT CGAGAGGTGG 
AAAACCGGGG CGGATCGCGC 
CGGGGTTTGG GGGATATTTC 
CGGTACCCTT TTGGGGCACA 
TCCTACGGCC GCACCCAGCC 
GGAGAGAGAG GGATTCGAAC 
TCCGATTCGA CCGCTCTCGC 
ACCCTCAGcG GGACAAGTCC 
TCCGCAATCT GACACTCTAT 
GCCCCCAAAC CGGAGCaGGG 
GCAGGGAGCC CGATTCGACC 
TACCGAATAC TCCCCGCGGA 
GTTTTCAAGA cCGTCGCCTT 
GTGTAAACGT TGCTCCTGTC 
AAAACTTCCC TTCTGGGGAG 
TTGTAGCTTG AGTAGGGGAG 
TGGTGCAGTT GCCCGATACG 
AGACTCCATT TCATAAACAG 
CTATTCAGAA CGCTGTGGTG 
TTATGTGCGT TCCCCGTGCA 
C CAGGCGG AG ATAAGAACAC 
TCCTCAGATG CCTGGTGAGG 
GGTGCGGCAA ACGCCGTGTT 
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CACGTGCGAT TQGCAAGCGC AATATTCACG 
GCGGGCACAA GGGGGATGAC TTCCTAATTT 
AAGTGTATTT TGACGGTGAA GTATATCATC 
ACGAGGAGTC GAGTGTGGTG CcrAcTGCGT 
GGGGTATCAT GTGCCCTTCA CATTCCGCCA 
TCTGCTCACC TCTATCGAAT ACGC TTGATC 
ATTCGGGACT CGATTGAGCA GATGGAAAGG 
ACCGAAGCGT CCCCCTACCG TTGCTTCCTT 
GTGGGCGGCC CCGGAACGGA GAGAGAGGGA 
CACGACTTCC AATCGTGTAC TTCGGCCACT 
GGTTCTGAGA AGGGGGTGCG ACGTTTCCTC 
CCTCGGCGCC CTTGCAAGAG CGCTACGGTT 
ATCTCTCCTC AACAACAACG GCAGAGCCCC 
CGTAATGAGA CTAGGCGGAT TCGAACCGTC 
CCAGCTGAGC TATAGTCTCA AGGGAGTGGG 
GGGATTCGAA CCCCCGGCAC TCGGATGAAT 
ACTCTCGCAC CGCTCCAAAA AACAGCAAAC 
GCAGGGGGGA TTCGAACCCC CGGTGCCTTG 
CAACCACTCG GCCACCACTC CGGACGCCCT 
AAGTCTTTGT ACGAGCAGCA TAAAAAAGTG 
AAGCTCTTAG AGAAGTAGCG TTTTTATGTT 
TATATGGACG ATGCAAGATA TGCAGAATGG 
CATTTTTTTG ATCTTATGCG CCTCTATTTG 
AGGCTTGTTC AACAACTTAG TGCCTTCCTG 
CAGATGCTTG ATGAACTCGA CTTGTTATTT 
ACGCTCGAGC TGCTGACAAT TTTTTTTTAG 
GTCTACTGAA TTTAGAAGAA CGTCTTATTC 
TTACACAGGC AGAAGTCGCG AGCGTTGCGC 
ATGGTATCAA TCCTCTGTTG CAAAAAGCAT 



PCX 
TAATGAAAGC 
TTTTGGTGCC 
TTGCTATCTT 
CGGCAGAGTG 
GTATGAGGAT 
AAAGCGATAA 
GGGAAAGATG 
CCGGACCTAG 
TTCGAACCCT 
CGGACATCTC 
AGCCAACAAC 
TTCGAGACCG 
ACAGGACACC 
GACCTTCAGA 
ATGCCAACCG 
GCAACTCtTA 
AG ACGC AC c G 
CGACACAGCG 
TCCATCCTGC 
GTACGTGTAG 
ACGCTCCCCC 
AGTGCATCTT 
GGTGTGCTTA 

CAAAGAAAGT 
ATTTCTGTTG 

AACGTGTTGC 
TTTACCGCAT 
AGAATGGTAG 
gAGTACGGTA 
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3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
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GCTGGACTCA ATCTTTTTCT CATTCCGCAA AAGCGGATGC GTCCATCCGC ACAGTTATTG 4860 

ACAACAGATT TGATGCTGTG CGCATTGTAT tCGTTTTTTA CGCACGGGgA AAATT T ATT A 4920 

AAAGTCGA t G GGACGTTTAG GAAAAAGGCA TT t GTTATGT TCCAGGCATT GTTTCCtGTT 4980 

GATCCGGATG TGGTGAGTGT GGCACTCCCT GCATATCTGC AGAGAGCAGG GGAGGAAAGG 5040 

GGTACATCAC GTCTTTTACA GGAAGGTCGG CGCGTCTTGG AACATCTGGG ATTGATTGTC 5100 

TGCGAATCAG CACAGGTGCA TGTGCAAGAT AAACGGTGGG CTTCTTTTTT CTCCTTAACT 5160 

GCTCTGGAAC GTGCGGTGTA TTTGACAGTT GCCAGTACGG CTATTCTGCG CAAAGAGGTG 5220 

CTCGTACAGC GAgCGCAGGC TTTGCGTACA CTTCTCTGTG TGTTGCACCC AGATGCGCAA 5280 

TACGCACCTG AAGATCTAAC ACGCGTGTAT CGTATCTTGG TGGAAGAGGC AGCACCATCT 5340 

GTTGCTGCTG ATTTTTTCTC TTCTTTGTCT TTGTCCAAAG ATACAATGCT GCAAAAGCGT 5400 

AAAGGAGCTT TACATGATTC ATCGGTTTTT TCTATGCAGT CGGCGATCAC GGCTATACGC 5460 

ACGGCCCAGC TTTTTGGGTT GTTGTGTGTG AAAGATGGAC TGTGCGCGTT GAATGAGGCT 5520 

CTATTTAAAG GACAGTACAC GCGTGGGCCA GGAATGGTCT TGTCAGCGAC GGCAGAGTTA 5580 

ACCATTTTCC CCGATGGAGA TATGCAaGGG GTTTTGCCAA TTTTATCCTG TGCGCATGTC 5640 

TGCTCACTAC AAACAGTTGC CACGTTTGAG CTCAATAAAA AAAGCTGTAC CACTGGCTTT 5700 

GCGCGCGGAT TAACAGTGCA GGCACTTGCA CAGGCTTTAG AATGTAAAAC AGGTGAGCAG 5760 

GTGCCACAGA ATATACTATC TTCTTTCCGG CAGTGGTATG CaCAGATAAC CGCGTTGAcC 5820 

TTAAGACGCg GCTTTGTCAT GCAGGTTGAT TCATCTCAGC AAGCTTTTTT TGAATCTGGC 5880 

GGGCCACTGC ACCCGCTAGT GCGCACGCGT CTTGCAGAAG GAGTGTACTT TTTTGATGAA 5940 

TGCCAAGAGT GTATGTTGTA TCaGGCcTCG CGCGAGCGCG TCTGTCCTAC CTGTGCGAGC 6000 

CAATTGATAC AGCCACCCCG TTATTCCGCC CTGGTGAGCA GGGTGCACGT GCGCTCCATG 6060 

TGCCTTCCTT TTCTTTTCCA GTGCGGTCTG CTCGGGGAGT CTCCGAGGAA TCAACGCGAG 6120 

ATTTTGCACA TTTAGGTGCC TTTGTGTTGG AAACTCCGAA CGTTTCGTGC ACGCACAGTG 6180 

CTGCAGATAC TCCGTCTATT TCAGAACAGA CCGGTGGGGT GGCTCACGTG CAGAGCGAAG 6240 

AGGATGTAGA TCCGTCCACG TCTGGTGCAA CGGGTAAGTA TTGGGACAAG GCACAATGGC 6300 

GCaAGGTGCa ACGGATGCGA CGTGCTGTGC GGCTGCAGCG GCTCAAAGAG TTTGAGGCGC 6360 

ACCTGCAACA ACTAAAATTG GACGCAACAG AGCAGACGGA GCTACGTGCC CGCTTGCAAC 6420 

GGGGGTTGAT TCTGGATAGA ATGCAACTTT CGTCCGAAAC GATCCGCaGG GAGAGAACGG 6480 

AAGCGAGCGG GGTTGATTTT TTAGGCAAGT ATCGTCTTGC aGAGTGTGCG TTACGTTCTG 6540 
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GTGCTTTACT TGAGATTGAG ACTAGTTCAG GGCAGTCAGT GCATAAGATA GTGGG TACGG 6600 

TGTGCGCAAT TGAAAAATGC GAAGAGGATG CGTTGCTTCA CGTGTGTGTA CACGCAGAAC 6 6 60 

TTCCCCCTGA GCGAGTATCG ATTGCGCGCG CGTCCAGGAT AGTGCTACTG AAAAATTCTA 6720 

TTTTTTCTTG AGTCTGTTCT GAAGGGGATC CTTTTGTCTC TTGTAAAAAG GAATAGACGA 6780 

GCGGGTAGGA TATGAGTCGT AGGAAACAGG GACGAGAGTT ATTCAACAGT CATGTGGGCG 6840 

TGGTGTTGTC TTGTGTCGGT GCGGCAATGG GGCTTGCAAA CGTGTGGTTG TTCCCTGGAC 6900 

GCCTGGTGGA ATTTGGTGGT GTGACGTTTT TAATTCCGTA TTTTATTTTT CTATTTGGTC 69 60 

TTTCCCGTTT TGGACTGATG GGGGAGTATG CTTTTGGAAA G AC AC TGCGC TGCGGTCCTG 7020 

TGCGTGCGTT TACCCGTGTG TGTGAAACAC ATtCCATCGT GTTTTTTACG AGCACTACGA 7 080 

GGTAGCGGGT GGTTTCCGGT AGGAGTATTG CTCGCTACCT GCTCTTTTTA TGTAGTGATT 7140 

ATAGGGTGGA TCTTGCGTTA TGTAGTATTT TCGTGCACGA ATGC AC TTGC AGGTACTCAG 7200 

GCGCACGACC TGTTTTACCA GGTTGCAGGG ACAAGTGCGA ATGTGCCGTG GACGCTTGCA 72 60 

GCTATCGCGC TCACAGCGTG TGTAGTGAGT GCGGGCGTGC AAAAGGGGGT GGAGCGAGGA 7320 

AACATTATAA TGATGGTACT TTTTTACGGT GTCCTTGCGT TTATTACAGG ATATATATTT 73 80 

ACTCTTCCTA ACGCGTGGAT AGGTATGCGT AGAATGTTGG CATTTCAATC TTCATCATTG 7440 

TGCAATCCGA GACTCTGGTT GTATGCATTA GGCATGTCGT TTTTTAGTCT CAGTTTGGGG 7500 

GGCGCGGCTA TGGTTTTATA TGGCAGTTAC ATGCCAGATA CGGTGGACAT ACCGCGTACT 7560 

GCATTTCAGA CAGCGACCTT AGATTTTTTG GCATCAGGTA TGTCCGCATT ATGTTTAATT 7620 

CCGAGTGCGT GGGTTTTAGG TATGGACGTC AGCAGTGGAC CGGAGTTTTT GTTTGTAACA 7680 

ATAACCCGTG TCGCCTCGCA GATACCGATG GGGGTGATGA TAAGTGTGnT AwTCtTTTTG 7740 

TGTGTACTAT GTGCAGCGTT AAgTTCTGCA ATTGCTATGT TAGAAGTAAT ACTCGAGTCT 7 800 

TTTGTGCACA CGTGTACAGT GGGGCGCCGA ACGCTGACGT GGTCACTAGC ACTCGTGGTT 7 860 

GCGTTTGTAT CTCTTCCTCT GAATGCCTCG ATGAGAGTGT TCGAAACGTT TACAGATATA 7920 

GTGGTGGTTA TACTATCTCC GTTATCTGCC CTTATGGGGA GCGTGATGAT ATTTTGGGTA 7 980 

TATGGTGCAG AGCGTTGCCG TGTAGCTATC AACCGGTGTG CACGCGGTCC GTTGGGTAAA 8040 

TGGTTCACGC CGTATATGCG GTACGTGTAT TTGGGGCTTT GTGTAATGAT TATGGTGCTT 8100 

GGGGTAATGT TCGGTGGTTT TTAGTGTGAT GACGCGCAAA AGCGGCCAAA CCCACAGTTG 8160 

GGTAAATATA TTCTTTGCAA ATTGTCGACA CAACCGTTGA CGAGAGGGAT CGCAGGTGGA 8220 

GAGGTGTCGT GCCGGGGATA TGTTGACACG TCCGTACTCC TCAGTTTGTG AGGCTCCAGT 8280 
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TATAGGAGGG GGGATAgCTA CGCGTGAAAA GATTTGCTCT TATTGGACTT GGAGACTTCG 834 0 

GTCTTAGCAT GCTAAAGGAG CTGCTCAAGC TCACTAACAA TATAGTCCTC CTGGACAGGG 8400 

ATCGAACGCT CGTTGAAACC TACCGTAGCa GGGTGAGAAT CGTGCGCGCA ATTGAtGTGT 84 60 

TGGACGAATT CACTCTGTGC AAGATGATTC CACAGGaTAT CAACGCAGCG GTTATTGATC 8520 

TGGGGGTTAA AATTGAATCA TCAATCATGA TAACAACGTT TTTAAAAAAA TTAGAAATTG 8580 

CAGATATCGT AGTTAAGGCA TACAGCGCTG aACAAGGGCa TATCCtCTCG aGCGTTGGTG 8 640 

yTACGCACGT AGTkCTCCcG GACCGGGAGg CAGCTAAAAA AGTCACTCCT ATGATTGCTT 8700 

TCGATCTTCT TTTCAACTTT ATGCCACTTT CTGCGCAgCT GGnCAATTGC GGAAATGGCT 87 60 

GTGCACGAGG ACTATGTGGG AAGAACTTTG CGTGAAGTGG ATGTGCGCAA AAACTTCTCT 8820 

CTTAATATCA TTGCTATCCG TAAGCGCGAT GCAGAGGATT TTTGTTTTAT CAATGATCCT 8880 

GAATACTGCT TTGAAGCGAA CGATGTGTTG CTCGTTGCCG GTTCTCACAA AGACATCTAT 8940 

GCACAGTCGC AGGACAAGCT GGCACATACC CATAGCTTCA GCGACTTTTT CAAACAATGG 9000 

TTCCTTACCA GCTGACTTCC CAATGTTCCG CGCACGGGAG TAGGCGCGTG TAATCTTCCC 9060 

TTTTCCCGCA CATGCCTACG TAAAGGGGAA TATTTAGAGA GGGGGCTCAG CTTCAAGTTT 9120 

TGAAAAATAA GGCTCAAGCG TTGCCGCTTC CCGAATTGAG GTTGCAGTGC TTACCACCGC 9180 

AGCTTCAGCA CACGTGCGcT GCGCCCAGAG TAGTTGGGTA CACAAGTGCA TGTCTGTTAC 9240 

CCGTGCGTCT AGAATATGCT CAATAGCCGC AATACGCTCT GTTTCTGTCG TTC CGTTGAG 9300 

AAAGCGCAGG AAACTCGCAA AACTTTTATG CTCCGGTGTT TCAGGAGTTA CCGCCGTACT 9360 

GTACGCTCCT ATGATTAGAC GTTCCATCGT CTTCTGGTTA AAGAGACTGG AGGTGCCCTT 9420 

GTGATCGGCG TGCAGGTGCT CAATCGTCTT GAATATCACG TCAAGGGAGT GGAGCGG a TT 9480 

TGGATCGCGG TAAGTTAGTG AGGAAAATAT GCCATGTACT GGATCTGGCA GGGTGAATGC 9540 

ACCGTAAGCA CCACCTATCG TTCGAATTTT TTCCCAAAAC GGCTCAGTAC TTAGATATCG 9600 

GGCAAACACC TGCTCTACCC CGCGTCTCTC CAAAGGAAGC CGTGGATGTG CAAGGGACAG 9660 

CGCTGCAAAA CCCACTTGCA CAGGGGCTGG AAGCAGCGTC AnCATATTGC GGGTGCGCAT 9720 

GTGCTGCAGG GCCTCTTGAA AGAGCACACC GTGAGCGGAG GGTATCTGTT GTTCGTGCGC 9780 

TGTGGGCGCA GAGGTGTGGT GGATAAAATA AGTGGATAGC GGTGCACGAA AACACGCCAG 9840 

AGGTTTTGCC AATGCATCTA GCGCTGTGTG TAACGATGTT TCTGTACCAC ATACACATCC 9900 

GATCACTCCT GCAGTGAGCA GTTTTTCATG CAGCGCTTTG AGTTTGGCTG CCAGCGAAGG 9960 

GGAGGCAACG GTTTCTGTAC ACTCTGTCCA CAAAGCACGT ACCAAGCGAA TCTGCGTGAC 10020 
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TCCAGTCCAG AGTTCTTCTA CGGCCTTTGC TGCATTCACG CGAGCGTTTG CCTTTGCAAG 10080 

TGCAATGGAA TGTCCTGAAT GCATGGCAGC GCTATCCAAG TCATTTTTAT ATTGTGCAAG 10140 

GATGTCTTTT AACCGTCGTG TATCTGTAAA AGAAAGACTG CGCACGTGTG CACACACGTA 10200 

CGAAATCGCC TGTACGATAA AGCGCGACAG CATTTTTACG CTGACAACTA GCCACGCTCG 10260 

CCCGACTATA TCGCTGCGTT GCAGTGTGTT CTGTCCCCTC AGTAAGGGGA GTATCTCGCT 10320 

TCCCTGATCG CCTGCTACTA TACAGCGGGC GGCAAAGCCC CCGGTGAGAC GGGCAATCTC 10380 

GGCAGATACC ACACTCCAAT GATGTGTCTC GGTTCCCATA CCTGTCAGTG CGTAGCCATA 10440 

CAGTGGCAGA AG T TGTGCTT CTTTTACACT GAGCATATCT GCTGGGATTG CAAGGTGTAA 10500 

GTACGTAATA TCGTTCGTGG CAAGCTCATG CACGAGAACA GGAACAGAAC CAAAAAACTG 10560 

CATGGTTTCG CTCAGTTCTG GGGTGGGGAC TGGCAGTTGT TCCCGCTTTA TATGGGGGAG 10620 

CAACGCAAGG AGTTCCTCCG GATCGGGTGT TGTCTGTCGT ACACGCAGTG ATTCTTGGTC 10680 

AGCTCGGAGG CGCGCCGCTG CTGGCTGCGT GAGTGTACGG GAGAAATCCT GTACGTATTT 10740 

TTCTAATTGC TCATCAAGTT TTTTTGAGAA GTCTGGGTCT GGGTGTACCG AAAGTACCGT 10800 

GTACTGCGGG TTGCGCAgcA AG tGCGTGAG GATGAGATTP TCCACGTAGT GtGgATGGTG 10860 

GTGTACCTTT TCACGCAGgc CTGCAGTGCG GGGATATAAC GCAAAGAACT TTCTGGACCT 10920 

GCACCGTGCA ACCATCCACG CAGCGAACGC TGCATGAGCA CGAGAGAAAA AGGACCGTCA 10980 

GAGCGGCGTA CTTCAGTATT TGAAAATTCG AGTGCATTCA GCGCTGTTTC CACTTCCTGT 11040 

GGAGGGATGC CGTGCGCAAC AAGCGACTCT AGTGTTTCAA ACACGCATGC CTTTAGTGCA 11100 

TCGACCTGTG TATGCTGCAC CCCAGTCATA CCTACAAAAA AAAGCATACG CTTTAGATCG 11160 

ATGTGACTGC CGTTATATGC GTATAAATCC TCACCGAGTT CTGATTCTAA CAGTGCCTGT 11220 

GCAAGGGGAG CAGCATCGTG ACCGAGCAAA ACGTGTTCGA GCAAAAACAC GTCCATTAAC 11280 

TGTTCAGCCT TGTCTGATTC TGGGAGTAAC CAGCTGAGCA ATACGGCGCA CCGTGTTAAA 11340 

TCCATCCCCT CGCTCGCCGG TGCGTACCCG GTGTACGTAC GGGGACTTTG GTATGCAGGG 11400 

ATAGGGGGGA TGGGGGGCAA CGCTTTGCGG GCAGAAAATT TTGAAAGGCA TTTATCCTCA 11460 

ATAAATGCCA TCTGTTTTTC GGTGGGTATA TTTCCGTACA GAAAAAGCTT GCAGTTTGAC 11520 

GGGTGATAGT GTTTTTTGTG AAAAGCTTTA AACGATTCGT ACGTGAGACG AGGAATAACT 11580 

GTTGGATGAC CTCCTGAATC GTGTGCATAC ACTGAGCCAC GTGTGGTCGC GTGTGTTGCG 11640 

TGCTTATACA CAAGCGTATG AAAGTCTGCA TACACACCGC GCATTTCATT CAGTACAACG 11700 

CCCTGGAGGG TAAGTTGGTT GTGCTCATTA AACTCAAAGC GGTGTCCTTC TTGCTTAAAG 11760 
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GTCCACTCTT CGATCAGGGG GAAAAAGACT GCGTCTGCAT ATACACTCAT 
TAGTCAGTCT CTACCAAGGA GGAGGCCGGA TATACTGTTT TGTCCGGAAA 
TTAAGAAACG TTTTCACGCT TTGTTTCGCG AGTATGAGGA ACGGATCCTT 
TGCTGTGATC CACAGAGCAC CGAATGCTCA AGGATATGAG CAACCCCGGT 
TCTGCCGTCA TAAAACAGAA GGCAAACAAA TTCTCCGGGT CTTCG TTGAG 
AACTCAAGCC CTGTTTTTTT GTGTCGAGCA TAGACACCCA CTGCCGAAAG 
GAATGGCGCC AGATAATTTC AAAACCGTGA AGAAGCGTAC TCATCGGTGA 
TCTTCTTGCA AGCTATTTGG AAGCAATGTG CTGTTGCGCG CCGGCACTGC 
TAAAAAGTGC TCAGTGATGG TGCGCGTATC ATGCGCTGAA GGCGGGAACG 
ATCnCGCACA AGCTGTGCGC GGTTTTTAGA GAGATTAGAG AGAATTTTCT 
AGGATGATTG GTG TTGATGA GCGCAGCAAG AGTTTTTTCA GAACAAGACG 
TTGCAAAAAT GTATCTGGGA GCGCGGGGAT ATCATCCAGC GTGAAAAGAT 
ACGCGCTGCA AGTGTTGGAT TTTTTTCTGC AAGGGCATGA AGAATTGAAT 
GCGCTCCATC TTTTTGAGAA TTGCCGCAAG CACTGCATGC CCGTCAAGAT 
GGACAAATGG AGCGCTGCAA ACTTTTTGTG CAAGGAGTCA CTCATGACTT 
AGGG TTAACG TGC TTTAACT TTGCAAGGCG AACGATCAAG TCCTTCTTCT 
GATATTACTC AAATAGTGCG CAGCGCTTTC TGGAGGCAGC TGCGAGAGGA 
GGTGGCAGGT AGTTCTCcTT CCAGGAGGGG GAGAAGTTGG GAGGCTTCAA 
AAACTCAAAA GGTTTCgGCT tGCCGCTGGC ACCGCCCGCT TCAAGATAAG 
TCTTCCCCAA ACGCTTTGGA AAGCATCGAc TGCGCAGCAC GCAGTCCACC 
GACACACGAG CGCAGAGGGC AG AAAAC TCC CGTAGGATCT CACGCGCTTC 
AGGGGTTTGA GTGTCAGGAG CTCGGCAACC ACCGCCTCAA TCTGTGCAGG 
TTGAGCACCA GCGCCGCCTG CTCTTCTCCA ATGAGGGAGA GGAACTGGGC 
TAAACGGTTC GGCCTCGGTC TTGTTCACGT ACGGTGGCTT TGATTAAGCC 
TTCGGTTCTA TTnCATAGCA AAGAGGACTC CGCGCGGTCT CCCGGCACAC 
TGTAGTGGAG GGTGTGCTCT TGACACAAGG GCGTGCAnAC CTTAAAAGGT 
CAGACGGGGT AGGGGTCCAA GGATGTGATG GCGTTGTCTT TCGGTTn 
(2) INFORMATION FOR SEQ ID NO: 56: 



PCfl^PB/13041 

AACATTGAAG 11820 

GGTTAGAGCG 11880 

GAGGGGATAA 11940 

ACTTGCTTCT 12000 

AATGTGGTAC 12060 

CTCAGCGAGT 12120 

TTCTCACTCC 12180 

GCAATGTAGC 12240 

TGTCGTACTC 12300 

GAACAAAAGC 12360 

CCAGGTGTTT 12420 

GCGTGCGG AC 12480 

GCTCAGTCGC 12540 

CACGGCGCTG 12600 

GCAGCACCTG 12 660 

CCTCTGTGCT 12720 

TGAGTGTTTT 12780 

GCGCAGCCAA 12840 

ATCGGCCTTT 12900 

GGTAACAGGC 12960 

TTCTGGACTG 13 020 

CTCAAGTTGC 13080 

AATCTTTTTA 13140 

ACGnAGGAGA 13200 

GC AGTTGCAT 13260 

GTCCCCCCCC 13320 
13367 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6856 base pairs 
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<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GCATTGcTGC GTCTCGATAG GCTGTTCGGT ATCAGCTGCG ATGATGAGGT GACCGGTCAG 60 

TATCACTATG TGGTTATAGT TGGTGCGGCA GAGAAAAAGG TGGGGCTCAT GGTGGATGCG 120 

CTGATTGGTG AGGAGGACGT ATCATCAAGC CACTGCGGGA TCAATTCACT AGTTCCCCTG 180 

GTATTGCAGG GGCATCTATC CTGGGTGACG GTTCGGTGTC GTTGATTATC GATGTGGGGC 240 

AGC TGCTTGA GCTTGGGTTG AAGCGGGAAA TATTGGCGCG TGAgcgTcGA GAAGCCACGG 300 

TGTGGTAGGC GATCTGGGGC ACGGATTGGG GACTATGATA GAGCATATGG AAGCAGAGAT 3 60 

CGGCATTCGG GAAAGTTTCG ACGGGGGCGT ACGTGAGCCG CTTGCGgTCA TAGACTTCAA 420 

GATGGTTACC TTTTCCCTCG CGGGGAAGGA CTACGCGGTA GATATCATGC AGGTGAAGGA 4 80 

AATTGCAAAG GCTGGGAGCT TTACCTATGT GCCCAATACG TCTCCGTTTG TTCTGGGGGT 540 

GTATAACTTA CGGGGGGATA TTATTCCCAT AATTGATTTA AGGAGATTTT TTAATATTCC 600 

CGCTCCGCGC AAGTCCCGGC AGGCGATCGA GAATATGGTG ATCGTCACAG TGGAAGATCA 660 

GACATTCGGG GTTGTAGTAG ATGGCATCGA TAAGGTAATT GGGGTGTCAA AAACAACTAT 72 0 

TCAGCCGCCA CACCCTATCT TTGGGGACAT CAACATAAAG TATATCCGGG GGGTGGTTGA 780 

GGAGGCGGGA AAGCTGTACA TCC TACTTGA TGTGCACCGG ATTTTTTCCT TCCGTCTTGG 840 

GGAGGAGGAA CGGACGGCAG TTGTCGATCG TGGTGTTGTG CCGTCTCCTT CACCTCCTGC 900 

CGTATCTGTG CCGCCGGGGG ATGAAGAAAA TTTAAATGTT GGTTTCATTA GCGATACGTT 9 60 

GGCCGCGTTT GGCCGTTTCT TTACCAGTGC AGTGAATGAG GGTTGGTTGC GCAgCCGGTA 1020 

TCTTGTGTGG CGTGACGTGC GCTCTGGAGC TGAGGTACAG CTTCAGCATG AGGAGGATGT 1080 

CGCCGAGTTC TTGAGTACAT TTCCTTCCCC GGACACAGGT GTGTTTTGGT CGGGGGAGTA 1140 

TGCGGCGAGT GTGGGATCTG TTCTTTCTCG GATGCAGGTG GGAAAGGTGG TGACGGTGTG 1200 

GAATATCGGT TGCGGTGCGG GTCACGAAAG TTACAGTCTT GCGGTGCTTC TCAGAAAAAC 1260 

CTTCCCCGAC GCGGTGGTTC GGGTGCACGC AAGCGATTCG GATCTCTTCT CCATTTCCAA 1320 

TGCTCCCATG cTCACTGTTC CTGAgCATGT GATCGGTGAT TGGTATAAGC CCTATGTGGT 13 80 

GAAGGGGGTG AGTGGTTCAT ACACCTTCTC CCAGGAAATT AAGGAGATGG TCCTGTTTGA 1440 

GTACCACGAT TGTACGCATC CGAGTGCGCT TCCAGACGTC GATCTTATCG TGGCGCGGGA 1500 



Printed from Mimosa 02/03/22 07:25:47 Page: 507 



WO 98/59034 . 5Q6 r ^ 1 '^^>' 13041 

CGTACTGTCA TCTCTTGCGG TTCCAGTGCA GCACACCCTG TTGAAGGAGT TTTCTGAGAA 1560 

GTTGAAGGCA ACAGGAGTTG TTCTGCTCGG TCAGAACGAG GTGATGCCTA AGGATACAGG 1620 

ATGGTTGCGG CAGATTGAAG GCACCGTTGC GGTGTTCAGC AAGGAATAAT TAGCGCATGA 1680 

GGAGTGGTGT ATGCGTGTAG AGTATATCAA CCCGTTCAGT GAGGCGGCGT ACGTGGTTCT 1740 

GTCTGAGGTT TTAGCAGGGG AAACCAAGCG GGGGGACTTG TATTTGAAGT CTACGTGCAT 1800 

GCCGGTGATG GGTGTTGCGG CTATCGTTGG CCTTGCAGGG GATGTAGAGG GGCGTGTGGT 1860 

ATTTGACATG ACGCTCGATA CGGCGCTGAA GATTGCCTCT TCGATGAACG AGGAGAAGTT 1920 

AGCGGCGTTT GATGAGcTTG CGCGTGCGAC GATCACCGAG CTCGCCAATC TGATCACCGC 1980 

AAAGGCGGTT ACTACGTTGC ACGAGCTCGG ATTTAAGTTC GATCTTACCC CTCCGGCGCT 2040 

GTTTACTGGG GACAACATGG AAATATCTAG TAGTGATATT GAAgCGCTTA TCGTGCCCAT 2100 

GGAGAC GCCT CAGGGTAAGG TGGAAATTAA TGTTGCCATC CGCGACAAAG TATAAGAGGG 2160 

AGGAAGTATG ATTTCCAAGC AGGATTTTCC CACGATCAAC GATCGGGTTC CCGCAGaCaA 222 0 

AAACCGAATG GGGCGCCCTA TCGTGTGTTG GTGGTGGACG ACTCCATGTT CGTTTCAAAG 2280 

CAGATTGGTC AAATCTTGAC AAGTGAAGGC TACGAGGTTG C AG AT AC TG C GGTGGACGGC 2340 

GTTGATGGGG TTGAAAAGTA TAAGGCGATG AGTCCGGGCG TTGATTTGGT GACGATGGAT 2400 

ATCACGATGC CCAAGATGGA CGGGATTACt GCGCTTGAGA AGATTCTTGA GTTTGATAAG 2460 

AATGCAAAGG TAGTTATCAT TTCGGCGTTG GGGAAAGAGG AATTGGTGAA GAAGGCACTG 252 0 

TTACTGGGCG CGAAGAACTA TATTGTCAAG CCGC TCGATA GGAAAAAGGT GTTGGAGCGA 2580 

ATTGCAAGCG TAC TAAAGTG AGGGCGGATG TGTCCTGCGG GCTGTCTCGT ACGGTTTGCC 2 64 0 

CgCTTGCGTG TGTGGATGGT TTCTTGAGGT TTTTGCCTTC GCGCGCGGAG TGCCCGTCTC 2700 

TCTGCGTGCG TGTTTTCTGT GTGTG TGCCG CAAGAGGAGA AgTGGTGTCT CCCTCAGCCC 2760 

TTTGCTCGGG GCCCTGTGGT GCCTTTCCTG CGGTGTAGTT TCTATACTCC TTCGTAGTTC 2820 

CTAGTTGGTT TGGTTGGAAA GGGTTCGGTT CGATTTTGAA GAGGTGCACA CGTTGTATTG 2880 

TGCGCATGAA AGAGGGAGCG GTGTGTGCTC TTCCTGAAAA CGCTTGAGGT ATTTGGCTTT 2940 

AAGTCGTTTG CAGATCGCGT TCGCGTTGAG TTTGCAGATG GCGTCACTGC GCTGTTGGGC 3000 

CCAAACGGCT GTGGCAAAAG CAATGTCGTT GACGCCATAA AGTGGGTCCT CGGAGAGCAG 3060 

TCCTCTAGGG CCTTGCGTGC CGACAGAATG GAAGACGTTA TATTCAACGG GACCGAGTCG 3120 

CGTCGTTCGT TGAACGTTGC AGAAGCCTCT CTTACCGTTT GCGATGAAGC TGGTATCCTT 3180 

TCGCTCGATG TGCCAGAGAT TTTAATTAAA CGCAGACTCT ATCGTTCCGG GGAAAGTGAG 3240 
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TACTTTCTTA ACGGGAATGC CGTCCGTCTA AAGGAGATCC GCGAGCTCTT TTGGGATACG 3 300 

GGAATAGGGA AGGTTGCGTA CTCCGTTATG GAGC a GGGGA AAATAGACCA GATTCTCTCA 3 3 60 

AATAAACCGG AGGAACGTCG CTACCTTTTT GAAGAAGCAG CaGGGGTGAC GCGCTTTAAA 3420 

GTTCGTGGCG CGGAAGCAGC a CGG AAATTG GAGAAAACGG CGGAGAATTT GCGTCATCTT 3480 

GAGGTTATTC TGCAAGAAGT AGAGAAGAGC TACGAGAGTT CAAAGCTCCA AGCTGCCCAG 3540 

ACGCAACGTT ACCGCATGCT CAAAGAGGAG ATTTTTGCGC GAGATCGCGA TCTTGGTCTG 3600 

TTGCGTCTGC GTGGGTTTTT AGAAAACCAA GCCCGAGCGG ATGGAGCACT CCAGCGCAaT 3 660 

cGCGCGGCGC GACGCGTTGC AAACACAGGT GGAGGAAGCA CAGCAGACGC TTTCTGCTCG 3720 

CATAGGCGAG ATCAATGATA TGGAAAAGcg CGTTGACGCG CTCCAAAAGG AAATCTATGG 3780 

CCTTGCAATT GAACAGAAAG CGAnCAAAAC GAGGCATCGC TACATCGTAA GCATCTTTCT 3840 

GAACTGAAAG AGTCGATTGG TCAGATAGAA ATGCGCAAGA TTGGTGTAGA AAGTCGCGTG 3900 

CAGAATTTGG AAGAAGAAGT AGCAGAGCAA GACGCACACG TGTATCAGTT AGGCAGTGCT 3960 

CTATCCTCTG TTGAAGAGCA TATTGAATCG TTTGCGCGGA CTTGCACGTT GCAAGTGAGC 4020 

ACGTCTCAGA GAATGATCAA ACGCTTCGCG ACATACAGGG ACAGATGCAA GAGATAAGTG 4080 

CCGCGTGTGT TGAACTTGAA GCGTCCCTAC GTGACGTGGC AGAAGATATT GCCGCAGAGC 4140 

TTGACACGCG CCTGAGTGCA GCCGGGTACT CTGCGCGCAA TCGGGCAGAG GCTGAGCGTA 4200 

CGTTGGTAGC GGGGGTACAG CGCCTGCGAA CCTTCGTGGA GGGGAGAGCA CGTATTGTTT 4260 

CAGACTTTCT GGTGGTAGAT ACCCACACTG AAGGGGAGCT GTGCCGGATG CTGACTACAG 4320 

TTGTGGACGC GTTCAATGAG GCGGTAAAGA TAGTGCACTG CGTTGAGTCA GACATAGCAG 4380 

AATATGCGCG TGTTTCTGCC CGGTTTATCG ATGAGTTTGT TGCTCCTCAG GGGATTATGA 4440 

CCAAGAAACG TGAATTTGAG CGACAGCTTG AACAGCACCG TGCACAGCTT GAGCGGCaTG 4500 

CTGCGCGTCA GCrCAaCTGn CAGGAAGAGA ACAAGCTCCT TGTTGGGAAG ATAGAAGCCT 4560 

GTCGCAAAAC GCTTGAATCC CTGCGTGTGG ATCAGGCGCG TCTGCGTGCT GAAGCTGAGG 4620 

CAGGACAAAA ACAGGCTGCA GGAACCAGAG GGGAGGTGGC ACGTCAGCGC GCAGTGATTA 4680 

AAGAGCTCGA AGGGGAGTTG TTTACCGAGG GGGAGCGGGT GGCG gCGCTC GAAGAGCGCT 4740 

TACTAGAGGT TGAAGGGGAA ATAGGACAGC TAGAACAGCG CGGTGTTTTG CTCACCAAAA 4800 

GTCTTGAGAA CTGCGAAGGA GAGATCCGTG TGCGGAATGC CGCAGTAACA TCTGAAGAAC 4860 

ATGCGCTCCA GGAAGCGCGC GTGGAACTTG CACAGGTGGG GCGGCAGCTT GAGCAGGCAC 492 0 

ATCGGGAGTT GATGCAGTGC GAAACTGAGA TTCGCAATTT ACGTGAACAT TTTCGAGAAC 4980 
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AGCACACCCG CGATCTGAGT GAGTTTGAGG ATTTAATACC GGGGaTTGAA AAAACGGCAA 504 0 

GTGATCTGCG CCaAGAGCGT GGGGAGCTTC aGGCTCGAGT GAAGGAAATC gGGGCgGTGA 5100 

ACTTTATGGC GGTGGAGGAG TTTCAGGAGG TAAAGGAGCG CTACGAGTTT CTCGTTGCGC 5160 

AGGTTGCGGA CCTTGAAAAG GCGCGCGCAG ATCTGCAGCG GGTAACCGAT AAAATTAAGG 5220 

CTGAATCTGC AGAACTTTTC TTGGCAACAT AC CGACGG AT TCGTAAGAAT TTTcACGAGG 5280 

TATTCCGTCG TCTGTTTGGG GGAGGTCGCG CAGAGATACG TCTTTCAGAT CCTGCAGCGG 5340 

TGCTCTCGTG TGGAATTGAA ATCCTCGCGC AGCCACCGGG GAAGAAGCTC GAGCATATTG 5400 

GCCTCCTTTC TGGTGGAGAA AAGGCAATGA CTGCAGTAGC GTTGCTCTTT GCAACGTATA 54 60 

TGGTGAAGCC TGcGCCGTTT TGTCTTTTGG ATGAAATCGA CGCAGCGTTG G ATGAG CAT A 5520 

ATGTAGCTCG TTTTGTTGGG ATGCTTGATG AGTTTTCTGA CGTCAGTCAA TATATCGTAA 5580 

TCACGCACAA TCGGCGGACG GTTTTGGGTG CACGCACCAT GCTTGGGGTA ACAATGGAAG 5640 

AGCCGGGGGT ATCGAAAGTG GTTTCGATTG CACTTGAATC TGCTTCTGAG CGACCGGCTA 5700 

ACGGCGAGGC AGGAGGAGCC ATTTGATGCG TCTGCGTGGG GTGGCAGGTG CCCTGTTGGG 5760 

TGCGGTAGTG CTTGTGGCGT TGGGGCTGAT GGGCGTCTGG TGGGTGTTCT ATCCAAAAAA 5820 

AGGGGACCGT GGGGCGGCTG TGGCTCGCGA GCCAGTGTTG TTGCACATAG ATCCTGCACA 5880 

GATGGAGGCA GCTGATGAAC CGTTGACGCT TCCCCCTATC GAGCGTTCCC GTGAGCGGAT 594 0 

GTCGGCGTGG AGTGAGCAGG AGTGCCTCCG ACAGCTTGAG TATCCGACGG AAAAGGCGGT 6000 

GCAGGCATTA GAGCACGCAA ACGAGAAACG TATACAGCAG ATGCTAGAGG CAGTACCGTG 6060 

AGTGTGTGGG TGGC GCTCGC CTTGCTGGGA ATGTGTGTTT CGTGTACGCA CGTGCCTCCG 6120 

CCTCGTGCCC TCATCGTTTC AAAGGAGCCG CCTCCAGCGT TGGATTCTGC GCCGCGCCCT 6180 

GCGATTCCAG AAGCAGTTCC TCTTCCGTCC CCTGTGGAGG AAGAAATCGC CGGTCGCCTC 6240 

CCTCCTGCAC CTGCCGCTGC ACCTGAGCGC GTTCCTGAGT CCTCACAGGA GCGGGAACAG 6300 

AAACCTGAGT CTTCGAAGCC TCAGGTGGTA GAGCCGGTGT CGCTTGCCTC TCCGGTGAAG 63 60 

CCTCGCGAGG CTGGGAGTGT Ac C TG ATGTT CTTCCAGTAC CTGAAGTGTC GTCGCCGCAC 6420 

GTTGCGCCGC CGGCACCCCC TGCGCCGAmA GCTCCCCGGC CGCATCGTCC CTCCCCTCCG 6480 

CCTGTATCGC CTTCTGCATC CAAACCAAAG CAGCGCGCTG TACCTCCTTC TCCGCCCCCT 6540 

GCATCAGAGC CTCCTCGTGA GGCGGAGGTG CAGGCTGAGC CTGAGCCGGC AGAGGATTCT 6600 

CCACGCGCGA TGGTGCCTGA AGAAnCGACT GGAGGCATGA nGnnCCGCGC GTTTCGCnCG 6660 

GATGACAGCT TGCATGGGGC AAAAACTTGA GGTTTTGTAT CCGGGGCGAA GTTGGGTGTA 6720 



Printed from Mimosa 02/03/22 07:25:49 Page: 510 



WO 98/59034 eno 

509 

AGTGGGCGAG CATACTGCGC ACCTGGTTTG CGCTATCACC AGnGCAATTG GAGGAGTCGC 6780 

ATTCGCTTTT TAACTTTATG CTGAGCGAGA GGGTGATTTT GTCTTAGnTT CTCCTAATTT 6840 

GATGnGTTTC GGGGTG 6856 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

i 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CGCGTATGAA CGCAATGCCC AGGCGGTTAT TCCGTTGGAG CGTATCAGGC AG AC AATC CG 60 

TGCCGTTGAC GCGCGCGTGC AgtGCACTGG CTAGTTATTT TGAAAAGATA gGGGAAGAGA 120 

AGCGGcTACG GGTCCTTGCT CGTCTACTCG AACGCTATGC ACCGCTTATC GGCGAGCAAA 180 

AAATAACGGT ACgTTTCTTC GGTTATTGCG AGTCGCGGGT GCGTGATCTT CTCAATCAGG 24 0 

CGCTTCCACG TGCTGTCCTG CGTTCTCTCA CCCCCTTTGA TAAGGCTGAG GCCTGGCGCG 300 

CACAGTGCAG TGATGGGTTG ACTATTGAGA CGGAGGACGG GACGCTCCAG TGTCGGAGTA 360 

CAATCGAGGA GATCTGCGCG CAACTTTTGT cTGAAAAGAG ACAGGAGTTG GCGTGTGCCC 420 

TGTGCGGTAA TGGAGTGGTA GCGTGATCAA AGACGATGTG GTTACAGGCC GTGTAGTGAG 480 

GGTGTCTGGT CCCATTGTGT ATGCCGAGGG CCTCTCTGCG TGCAGCgTAT ACGATGTTGT 540 

CGACGTAGGg GAAGCATCGC TCATCGGAGA AATTATCCGG TTGGATGAGA GCAAGGCGgT 600 

CGTGCAAGTA TACGAGGATG ACACAGGTAT GCGAGTCGGG GAGAAGGTGA CAAGCTTGCG 660 

TCGACCACTC TCAGTCCGCT TAGGGCCTGG ATTAATCGGC ACCATTTATG ACGGTATTCA 720 

GCGCCCACTT GAGCGCCTCT TCCAAGAAGA CGGCGCCTTC TTGCGTCCTG GTGCGCGTTC 780 

ACAACCGCTT GATGGCTCCG TACGCTGGGA TTTTCGTCCT CATTGTAACG AGCGCGGTGA 840 

GGCCCTGTGC GCGGGGATTC CGATTGCACC TGGGTCaGTG TTAGGGACCG TGCAGGAGAC 900 

TCCTTCTGTT GTGCACACTA TCATGGTTCC TCCTGACATC CGGGGGAGCG TGCTATCTTC 960 

GTTCAAGGGC GCAGGTGCTT ACACAATAGA TGAAGAAATT GGACGCACTG ATCTTGGTGA 1020 

GCCGCTTTTT CTATCCCAGT ACTGGCCAGT GCGTCGTGCG CGTCCTTTCA GCAAAAAACT 1080 

TGCAGTGTGT GAGCCACTAG TTACTGGACA GCGGGCGATT GATGTTTTCT TCCCCCTATC 1140 

AAAGGGAGGA ACGGCGGCTA TTCCAGGGGG ATTTGGAACT GGGAAGACAA TGACGCAGCA 1200 
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TGCCGTTGCC AAGTGGTGTG ATGCAGATAT TATCGTGTAC ATCGGCTGCG GAGAGCGGGG 
CAACGAGATG ACAGACGTGC TCTCTGAATT TCCCAAACTC ATCGATCCGC GCACAGGACG 
CTCTCTTATG GAGCGGACGA TTTTGATCGC AAATACGTCC AATATGCCTG TGTCCGCACG 
CGAGGTGTCG CTGTATTCAG GGATTACCCT TGCGGAATAC TACCGTGATA TGGGTATGCa 
TGTGGCCATC ATGGCTGATT CTACCAGCCg CTGGGCGGAG GCGCTGCGTG AATTGTCTGG 
GCGCATGGAA GAAATGCcTG CGGAGGAGGG ATTCCCTGCG TACCTTCCGA CGCGTC TTGC 
AGAATTTTAT GAGCGCGCAG GACGCGTGGA AACCTGTGTG GCGCGCGAGG GCTCTGTGAG 
CATCATTGGT GCTG TTTCTC CCCTGGGTGG AGATTTCTCT G AGCCGGTG A CGCAGCACAC 
AAAGCGCTTC ATCCGTTGCT TTTGGGCCTT GGATCGTGAA CTTGCACACG CGCGTCATTA 
CCCTGCCATT GGGTGGATAG ATTCATACTC TGAATATGCG CAGGAAGTAA GTGCATGGTG 
GAGTAAGTAT GAcCCgCGCG CAGGCGTtGC GCGCCGCAGC CTTGGATTTG CTGAGAAAGG 
AACAGCgGTT ACAGCAAATT GTCaGGCTTG TCGGTCCtGA tGCGCTGCCt GGAGAAGATC 
GTCTGGTGCT AATGGTGTGT GAAATGATCA AAGGTGGCTT TCTGCAGCAG AACGCTTTTG 
ATCCGACGGA TGTGTTCTCC TGTCCCGAAA AGCAGGTGCA GATCTTGCGT ACCATAGTGG 
ATTTTCACGA ACGTGCCGTG GTGCTGCTGC GTGCAGGTAT TTCGCTTTCT GCGCTGTCCC 
AGCTTTCGTG CCGGGAGCTC ATCGTACGTA TGAAAAnTAC GTACGGGAAT GAGGATGTAC 
ACAAGATGCA GAAAGTGTAC GACACGATGT GCACTGAGTT TGACCAACTG AGTGTGTGTG 
CTGCCGCGCG CACACAAGGG GGGGAGAAAG TCGAATGAAG GGAGTGTGGT ATCGGGGTCT 
GTCCTCCATC GACGGTCCGA TCGTGGTGGC AAAGCGCCGG GAAGGTGCAT TCTATGGGGA 
GATTACGGCC ATCCGTGATC GCTTCGGTGC TCTGCGTACC GGCAGGATAA TTGATCTTTC 
TCAAGAGTGT TGTCTGATTC AGGTGTTTGG CTCCACGCTT GGGCTCAGCC TCGACGGTGC 
CTGCC t TG AG TTTTTGG AC g TGCCGATGCA GCTGCGTGTC TGTGAGGGTT TGATGGGGCG 
GGTATTCGAT GGATTAGGGA GACCAATCGA TGGTTTCCCA GAGGTGC TCT CTTCTCAATT 
GCGTAATGTG AACGGCTATC CTATCAATCC GTACGCGCGC GTATATCCAC GTGACTTCAT 
TCAAACCGGT ATTTCTGCTA TCGATGGTAT GAATACGCTC ATTCGTGGGC AGAAACTGCC 
AATCTTCTCT GGGAACGGCC TTGCGCACAA CCGTTTAGCA GCGCAGATTA TCAGACAGGC 
AAAAATTCTT GGCACGGATG AGGCCTTTGT GATGGTATTC GCGGGTATGG GTATTAAGCA 
CGATGTGGCC CGCTTTTTTG TTTCTTCTTT TGAAGAAACA GGGGTACTGT CAAAGGTGGT 
GATGTTCCtG TCGCTTGCAG ATGCGCCATC TATCGAGCGT ATTATCACAC CACGCTGTGC 
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ATTAACCGCA 


GCTGAGTATC 


TCGCCTTTGA 


AAAGAACAAG 


CATGTATTAG 


TCATTTTTAC 


3000 


AGACATGACA 


AACTACTGTG 


AGGCGCTGCG 


GGAAGTTTCC 


ACCACACGAG 


GGGAGGTACC 


3060 


CGGGCGTAAG 


GGTTATCCGG 


GTTACCTGTA 


TTCTGATTTG 


GCAGAACTGT 


ACGAGCGCGC 


3120 


AGGCAGAGTG 


AAAGGATCCT 


CCGGTTCGGT 


GACGCAGATT 


CCgAtCTTAA 


CTATGCCGAA 


3180 


CGACGATATT AGCCaTCCGA TCCCtGACCT 


GACCGGGTAC 


ATCACCGAAG 


GACAGATTGT 


3240 


GTTGCAACGC 


GACCTATCTC 


AGCGGGGCTT 


GTATCCGCCC 


ATTGGGTGTC 


TACCCAGCCT 


3300 


ATCTCGCTTA 


ATGAAAGATG 


GTATCGGGGA 


GGGTATGACA 


CGCGCAGATC 


ACCATGCGGT 


3360 


TTCAAGTCAG 


CTATTTGCTT 


CATACGCAAG 


AGTACAAAGC 


GTACGGAGCC 


TTGCCTCGAT 


3420 


TGTCGGAGAA 


GAGGAATTAC 


CTGCACTCGA 


TAAGTGTTAT 


CTGCGCTTTG 


GTGACTTGTT 


3480 


TGAGCAGTAC 


TTTCTCACGC 


AGgATGAGCA 


TGAAGATCGG 


AGTATCAGTC 


AGACGCTCGA 


3540 


TATCGGGTGG 


AGTTTGCTCT 


CACTTTTGCC 


GCGCACCGAG 


CTATATCGTA 


TCGACCCAAA 


3600 


GCTTATCGAT 


CAGTACCTGA 


CCGCTTCGTG 


CAGCGCGGTG 


AGTGATCAGT 


TGCGAAAGGC 


3660 


GATAGAGGAG 


GCCCGCACCC 


CGGTTGCGGA 


CGCGTAAAGA 


CCATGTGTCC 


TATAAGGCTC 


3720 


TTGGAGAAGG 


GTGATTTCTT 


TGCGGCGCTC 


CCTTGCTGTG 


TGTCTTGGCC 


ACGCAGGGAG 


3780 


AGGATACAGA 


GGTGAAAACA 


CCTTTAGCTC 


CCACCAAGTC 


GAATTTGGCG 


TATGTAAGAG 


3840 


ATCAGTTGGG 


TTTGGCTCGT 


GATGGTTATC 


GCTTGCTTGA 


GCAAAAACGA 


GAAATCCTCT 


3900 


TTATGGAGCT 


CACTTCTCTC 


TTGGAAGAGG 


TGCATCTTCT 


AGAGACTGAG 


CTTGATAAGC 


3960 


GTCGGAAGCA 


GGCGTATGCG 


TCGCTGTGGC 


AGCTGCTTCT 


TGC AC AGGGC 


CGCGATGATA 


4020 


TTGCTGCCTG 


TGCGCTCGTA ACACCgGTGC 


CCTGCCGTGT 


GCAGCAGGAG 


GTGCTTTTAA 


4080 


TTGCTGGATT 


GCGATTTCTC 


CGTCTGGATG 


CAGTGATGCA 


GCCACCGAAG 


CTGCAGTATG 


4140 


CTGCGCTCGG 


CTCCAGCGCG 


TGCATGGATA 


GAGCGCGGGA 


GGAC TTCGGG 


TTACTGTTGC 


4200 


AAACACTCAC 


GAGAATGGCA 


TCCGTACAGA 


CTATCGTATG 


GAGACTCGCG 


TC AG AAATG A 


4260 


GAAAAACACA 


GCGACGTGTG 


AATGCGCTGA 


GCAAGCAGAT 


AATCCCACAG 


ATGTGCGAGA 


4320 


CGTGCATGTA 


CATCGAAAGC 


GTGCTCGAGG 


AGCGCGATCG 


GGAAAGTACT 


TTTGTGCTCA 


4380 


AATCGCTAAA 


GGCGCGCAAG 


GATCCCACAA 


CCACCCTTTA 


GCACTCATCC 


GGCTGTACGT 


4440 


CCTGCGCTGC 


TGTTGTTCCG 


GGCCGACGCT 


ACCTCAGGGA 


GGCGCGTCCG 


ACACGCACTC 


4500 


TTCTTTTCCG 


CGGCCCTTTG 


CGTAGGtGCT 


CTTCTTCAGG 


AAAGcTGCGC 


GCGTGGGGGA 


4560 


CGTGCCGTGC 


TTCTCAGCGC 


GGCCCCTACG 


TTCTAGCAAA 


GCGGGaGCAA 


TGAGCTCAGC 


4620 


AATTTTTTTC 


GAAAGGGGAG 


AAACGGACAT 


TGCATACATA 


CGAGACGTGC 


GCACGATCTC 


4680 
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CCCGGCTACA ATAAACTTCG GCCGGTCTTT ATACACACAG GACCCAGGAT GAATGTGAAT 
ACACTCTGCA GTGAGCGAGC GATACGAATC GCGGTGTTCT TGCACGCACA CGAACTGTAT 
CATGCCCGTG CCCACGCAGA TTAAGTATTC TTCCACGGTC CCGCCCGTTA GCAGAGGGAA 
TCCCATGCCA CTTACGATAA GCTCGAGCTG CTCCTTTATG TTAGAAATCT CAGCCATAAT 
CCGCTCGTCC AAATAGAAAT GTTGGCAAAA GGAATGTTTG TTATTTTCtT GCgcATACGC 
GCGGTAGATC CGCAAAAAAG ACACGAAATC CCCCATGGGA TCGGAGAACG TGCCGTGTGC 
CTTTTTTGTT TTTTCTTCCT GATCCTCTGA AAAGATTAGC GGGCTGCGAG CAGACAGAAA 
CGCCGCAGCG ATAAGTACAT CGTCAATAGA ATGGGGATAG CGCCGCAGCG CCTCTACAAT 
CATCCGGGAC TGCCgAGGAC CGAGCGGAAA CAGGCACATC ATTTTTCCAA TTTCACTCAG 
ACTCCGGTCA TCTTCTAACG CGCCGAGCAA GCGCAACGTG TCTTCTGCGC CGATAATACC 
ATGGGTGCCA GGAGGAGAAA TAAAATCAAA GTGTTCGAAA TCG TGG AT AC CGAGTTCTGC 
CATGCGCATG ACTACCTCAG ATAGGTCAGT GCGGTAGATT TCTTCAAGGG TGTACGGTTC 
ACGCTGCTCA AAATCATCGC GC GAAT AT AG GCGATAGCAC GTGCCTGCGC GTACTCTGCC 
TGCGCGTCCA CGCCGCTGGT TACACGAAGC CTGAGAAATA GGAGTTTCGT CCAAACTTGC 
AGTATAGGAA AGCGGGTTAT ACGAATTTAA CTTCACCAAA CCAGAGTCAA TGACGGTAGT 
TACATCGTCA ATGGTGATGG ATGTTTCTGC AATATTCGTT GCGATGACGA CTTTTCTTTT 
TCCAAATGGC GCGCGGTTAA AAACTTGCTC TTGTTCTTCT TTACTCAATC TTCCATAGAG 
GGGCAAAAGA AAGAGCTTGC GGAACCAACG TTCATGGGAA AGACGGGTAA TACAATTTTT 
AATAGAACGC TCCCCTGGCA GAAAAATGAG TATGGCACCT TTGTCCCTTG AAGCGATAAC 
ACGCTCAACG ATACAAACGA TCTTTTCTAG CAAGGCGGCC TCCGCTTCCT TTGTATGAGT 
AGATGCaGGC GTATcAGGAG GATCGAAAAT AACAGTGACC GGGTATGCAA CCGCATCTAT 
TTTGATGAcA GGGCACTCAT TGAAATAGCG GGAAAACATG GCCGTGTTGA TTGTGGCAGA 
GGAGATGACG ATGCGGAAAT CATGCCGCTG TTGCAAGACG CGCTTAAGCA ATCCTAAAAT 
AAAATCAATG TTGAGACTCC GCTCATGCGC TTCATCTACC ATGATGATGG AGTATTTACT 
GAGGAGTGGG TCGAGCTTCA TTTCTTGCAG AAGGATTCCA TCAGTCATTA CTTTTATTTT 
TGTTTCGACA TCTGTGTGAT CCTCAAAGCG CATTTTGTAT CCGACAATGC CGGGCTGCAC 
GTGGAGCACC TGCTTGGCAA TGAACTCGCT TACAGAGAGG ACAGCAATTC TACGCGGCTG 
GGTGACGCCG ATAGCACCAC CTTCATTGTA TCCTGCTTCA TGAAGAATGA GTGGCAGCTG 
GGTAGTTTTC CCAGATCCGG TGGGGCTTTC GACAACAATG ACGTGATGGy kCGCGAcGCG 
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CTAAGAATTT TGTCTTTCTG AGAGTAGACG GGCAACTGCT TGTAACTGAA CATGATTGCA 
AGCTCCTCTT ACTGCGTGTG GATAGGmCAG GATAGAAAAA AGAACCAGAA GTGGGAGTGG 
TGCGAACGGG CGGTAGGGAG CGTCCGCACc GCACTGCGgG AcgGTGcTGA GAGTACAGAA 
AGACGGAGCG ACCAAGCGCT AGTCATTGAC ACGTTCTTGA TATtCATtCG TCTCTGTATT 
TATCAGGATC TTCTCTCCTT GCTTGATAAA TAGGGGAACG CGCACGACAA GACCCGTTTC 
GGTAGTCACA GGCTTTGTGG CGCCAGAGAC GGTATCCCCC TTAAGATACG GCTCGCTGTG 
TGCAACACGG AAAACCATTT TGGTGGGAAT TTTTATGTCA ATGGACTCCC CGTTCCAAAT 
TAGGATGTCG TATTCGTCCC CTTCGCGCAA GTAGCGCTCT CTTCCTGGGA CATTCCCTTT 
GGAAACGAAA ATCTGTTCAA AACTGCGGGT ATCCATAAAG ACGAAGCATT CCCCGTCATC 
GTACTGATAC TGAGCGCGGT GGCTGTCTAC AACCGCATCT TCGACTGTAT CTGAGGTCTT 
AACTGTCTGA GTGAGCACAG AGCCGTCACG AAGATGTTTC ATTTTAACGC GCGCAAACGC 
AGCACCCTTA CCCGGGTTTA CGAACTCGCG CTCGACAACC AGGTACGGAG CACCTTTATG 
GAGCAGGACC GTCCCCTTTG CGATATCTCC CCCTCTAATC ATGTAATTCC TCTCTTATCT 
CCTAGTAAAC GTCTTGCACG ACCTGCGGGG GCGCAGTATA CCGCGCAGtA TATTTTTTAA 
AAGGCCTCGA ATGGAGGCAT TGACTTTTCG TCCCTTGCCT GGATACTAGG CGCCCTATGG 
CGAAGAACAC TGATATTGAG CACGACGCGC ATGAGCCGGC CGGGCACGGG GATGTGCGTG 
AGTCTGCCGT GGAGAATCCG TCTGCTTCGG CAGTGTCTGA CGGGGAGGAG CGCGCCACGT 
TTGCGCCGGA GtTGCTCCGC AAACCGATAC CGAATCAGCG CAAGGTGCAG CACAGGAGTC 
AGAGCCAGAG GTACAGCGCG CAGGAGAAGC TGAAAAGGGT GTACCAGAGA AGGCTAAGGC 
AGTAGTGCCG CTTGATGAGT TGTTGCCGCA GAAGGTCCAC TTAATTCCGC TCACCGGACG 
GCCTATCTAC CCGGGTATTT TTACTCCGCT TCTGATAAGC GATGAGGACG ATGTGCGTTC 
GGTGGAAAGT GCGTACAGCG AT AGTGG TTT TATTGGGTTG TGTTTGGTGA AAACCGACAC 
GCAAAACCCA ACTATCAGTG ATTTGTACGA GGTAGGATCG GTCGCTCGTA TTGTGAAGAA 
GATTAATCTG CCAGACGGTG GGTTAAATGT TTTTATTTCT ACACAAAAAC GTTTTCGCAT 
CCGCAAGCAC GTGCACCACA GCAAGCCTAT CGTAGCGGCA GTGCAGTACC TGTCCGATCT 
TATTGAGGGG GATCCACTCG AGATAAAGGC ACTTG TGCGT GGCCTTATTG GGGAAATGAA 
GGAGCTTTCT GAGAACAATC CACTTTTCTC AGAAGAAATG CGGCTGAATA TGATCAACAT 
TGATCACCCC GGCAAAATCG CCGATTTCAT CGCGAGTATC CTGAATATTT CAAAAGAAGA 
GCAGCAACGC ACGCT AGAGA TTCTGGATGt GCGCAAGCGC ATGGAGGAAG TCTTTGTATA 



1/13041 
6480 
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TATCAAAAAA GAAAAAGACT TATTAGAAAT CCAGAGAAAA ATTCAAAATG ATTTGAACAG 8220 

TCGGGTGGAG AAAAACCAAC GCGAGTATTT TCTGCGTGAA GAGCTGCGTT CCATCAAGGA 8280 

AGAGCTGGGT CTTACCACCG ATCC AAAGG A GCGTGATCAG CGGAAGTTCC GTGCGCTAAT 8340 

AGATTCGTTT CACTTTGAAG GGGAAGTGAA AGAGGCTGTG GAGAGCGAAT TGGAAAAGCT 8400 

CTCCCTTACA GACCCGAATT CCCCTGAATA TTCaGTGGGT CGAACGTACC TC G AGACGGT 8460 

GCTCTCTTTa CCTTGGcACG CTCCTGAGAA GGAGGAATaT GACTTAAAGA AAGCTCAGAA 852 0 

ACTGCTTGAT GAAGACCATT ATGGACTCGA GAATGTCAAA GAACGGATCG TGGAGTATTT 8580 

GGCGGTGCGA AAGTTACGCG CCGATACCAA AGGCTCTATC ATCCTGCTGG TAGGTCCGCC 8640 

GGGTGTGGGA AAAACCTCGG TGGGCAAGTC GATAGCGCGC GCCATCCACA AGCCCTTCTT 8700 

CCGTTTCTCG GTTGGAGGGA TAAGCGATGA GGCCGAAATC AAGGGGCACA GACGTACTTA 8760 

TATCGGCGCC CTGCCGGGTA AGGTGCTACA GGGGCTGAAA ATAGTAAAAA CTAAGGCTCC 8 820 

CGTGTTTATG ATCGACGAGG TGGACAAGAT TGGTTCTGGC GCGCGCGGCG ATCCTGCGGG 8880 

GGCTCTGCTG GAGGTGCTTG ATCCGGAGCA GAACaCTACG TTCCGCGATC ATTACTTAGA 8940 

TTTGCCCTTT GATCTCTCTC ATATCGTGTT CGTGCTCACT GCCAATAGCA CCGATCCTAT 9000 

TCCCCGTCCA CTGCTGGATC GCGCTGAGAT TATCCGTCTT TCCGGTTATA TC GAT AC GG A 9060 

AAAGGTTGAG ATCGCAAAGC GCCATCTGGT GCCAAAAACG CTGGAGAAGA ATGGTTTAAA 9120 

GCGTGCGTGC GTCTCTTATC GGAAGGAGGT GTTGCTACAC CTGGTCCATT CTTATGCGCG 9180 

GGAGTCTGGG GTACGGGGGC TAGAAAAAAG CCTTGACAAG CTGCATCGCA AGCTTGCCAC 9240 

CGAGATCGTG TTAGGGAAGC GATCGTTTGA TGACAAGTGT TTGATGGATG AAGCTCTCAT 9300 

AGGGACCTTT TTAGGGAAGC CCGTGTTCCG CGATGATATG CTCAAAGACG CGAACAAAGT 93 60 

TGGTACTGCG GTGGGTTTAG CCTGGACTGG CATGGGGGGA GACACGCTCC TTGTTGAGGC 9420 

AATTAC TATA CCAGGAAAAG CAAGTTTTAA GCTCACTGGG CAGATGGGAG CGGTTATGAA 9480 

GGAATCCGCT TCTATTGCCT TGTCCcTGtG CGCCGTTACA GCGCGCAgCA GCGTATCnTT 9540 

CGCCGAATTG GTTTGAAAAG CGCGCAATAC ATCTGCATAT CCCCGAGGGC GCAACCCCAA 9600 

AGGACGGTCC GTCCGCGGGG ATTACCATGA CCACCACGCT CTTcTCGTTG CTCACCCAGC 9660 

AGAAAGTAAA GCCTCGCCTA GCGATGACTG GAGAACTCTC ACTG AC CGG A CAGGTGCTCC 9720 

CCATCGGGGG ATTGAAGGAA AAGACTATCG CsCACGGCGC GGTGGTATCA AGGAGATCAT 9780 

CATGCCAAAA GCGAATGTGC GGGATCTGGA CGAAATCCCC GAGCACGTCA AGAAGGGCAT 9840 

gTGTTCCACC TAGTTGAATC GATGGAAGAG GTCCTTTCTC TCGCCTTCCC CAAGGGGAAG 9900 
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CGTGTCCGTG CTGGCACTGC CGCCCAATCT GCTTCTCCTG AAAcCCTTAC 
ATGCGCTTTC GTGCACGCGT ATCTCAGTCA ACTGCGAAgT GcGTCGTGTT 
GGCACGGgAG GACACATTTT CCCGGGAATT GCAGTTTTTC AAGCgCTTGC 
cGGtGCGTGT CGTGTGGATT GGTGCAGCGC GTGGTGCTGA TCGCTCCATA 
CCGGATTAGA GTTTTGTGGT ATCACCGCTG GCAAGTGGCG TCGGTACGCG 
ATTTTTTTGA TGTATTTCGA GTGCTCGTCG GTACGGTGCA ATCCTATTGT 
CTTTGCnCCC GCAGGCACTA TTTTCTAAGG GAGGGTTTGT GTCCGTGCCG 
CAGCGTGGCT TTTGCGCATA CCCGTTGTCA CGCATGAATC GGATATCAGT 
CCACACGCAT CAATGCGCGT TTCGCCGATC GTATTTTAGT CTCTTATCCG 
GTTATTTTCC CCgTGcGCGA CGCgcAGCAG TTCACTGCAC GGGGAATCCT 
ATTTTTTTTC TGCACAGGCA GAGCGTGCAT ACCAGTTTTT ACGCATTGAC 
CATTGCTCAC AGTCCTCGGA GGAAGTAGCG GTGCGCGTGA CCTAAACGCG 
CATGTAGCAC CTTCCTTACC GAACGCTTCT ATCTTGTCCA TCAATTTGGC 
AGGACCAAAT GCATACTATC ACCAATTCGC TTAGcGTCAA TGCTCGGCAT 
CGTTTCCTTT CATTCAGGgC ACATCTGCCC GATATACTCG CCGCGAGCGC 
TCTCGTGgCT GGTGCGAACG CGGTGTGGGA GTGCGCATGC TCGGTAAACC 
TTTTCCTCTC GAACGAGGGA GTTCCCGTGG GGATCAGATT GAAAATGGCA 
GCGCACAC 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



PCT^PS/13041 

AGGCTGACGT 9960 

C ACAGGAGGC 10020 

gCAcrGGCGG 10080 

GTGGAATCTG 10140 

AGTGTGCGCA 10200 

ATCTTGCGCG 10260 

CCGTGC ATCG 10320 

CCAGGACTTG 10380 

CACACGTCCT 10440 

GTGCGACAAG 10500 

CAAAAAAAGC 10560 

CGTGTTCTTT 10620 

GCAGgCAACG 10680 

GCCTACATGT 10740 

ACTGGTACTC 10800 

AATGGATTCT 10860 

GAATATTTT A 10920 
10928 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

TACAACGCCG TGAGCAACCC TGCACTCAAA AAGATCAAAG CAATAACATG GGGAGGAGCG 60 

CCCATCGCTT TTAACATCGC AATTTCTTTA CGCCGTTCTG TCATTAGCAC CACCAGTACC 120 

GAAGAGATGT GCACTGACGC TACTAGCACT ATCAAATACA TAATGAATAA CAATAACTTT 180 

CGTGATGTTC GGAAAGAATG AAATTGCGAT CGATTCATGT CTTGCCACGT GTACGCACTA 240 

AAATGGCTCG GTAATTGTTC GTGTACTTGC TCAATAAAAC GAGTCATCGC CTTAGCGTCA 300 
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AACGCGTCGG CAGTTTTTAC CACAAAAGAA AGTAGCGCAG ATGCGGGGGA GAGAATTTTC 
ATTCCCAGCG TGAGGGGGAT AAATACCCAC AACGCATCAA GCTCCTGATA TCCGCAAGAA 
ACAATACCTC CTACCACCGC GCGCACCATT TTGGGGACCG CACGACCTGT CCCTCCTTGC 
ACGAGGGTGA GTATCTGGCA CGTGTCCCCA CAGCGCACCC CAATGCGCTC AGCGATGCGT 
TTTCCCAATA TTAACGTGTG TACTCCtGCG GCCTATCTAC CAGTTCAAGT GAACCTTCGA 
CGGTTAAAAA TGGACGAAGT CcACGcTCAC TAGAAAAAAA ATCAGGGGGA ACTGCGCGGA 
TATTCCCCCC TGCACGCCCT GTTTTTCCGA TTACAATACC ATCTCC CTGA AGGTGCATCC 
ATCGTGAGTG ACAGTATGGG CCAAAGTCcT GCGCCATAAA TGCATTAAAT ATGCgctGCG 
CGTCTTCATA TCGTTGCGTT GCCGTTTCAT TGGGAGCAAG CGGCAGTATA TCGATAAACT 
GGAGGTGACC CGATCCGAGT TCAATCATCC GTGTGGTGAT CCCTTCAATC ATTCCATCAG 
ACACCACAAG GACAACAATG AGTGGGATGA TGCTAATCCC GATGCCGAGC GCGGCACAGA 
AAAAACTTTT GCGCAAAAAG GAACGTCGTT TTCCTGCTAC CGGAGTACCT GATAGAAAAG 
GTACTGGcGT AGGCAGTACG TGGTGCGCAT CACCGTGTAG AGATGGGGTG TGCCCATATC 
CTGcGCACAC TCCTGCGCAA CGTAATGCAC ACATGAAAAT AACTCGAATC AGATTCACCG 
CGGCGCACCT TTGTTTCATA TGCGTATCAA ACTTCCCTGC TGTAGCTGGT AACGGTAtCG 
GTCATCGATG CaATACGTGG GTCGTGCGTT ACAATGAGTA ACGTCTTTTG ATATTCCTCT 
GTCAGAGAGA AC AG C AGATC CTGCACTATC AAAGCGTTCT TGGGATCCAA ATTGCCAGTC 
GGTTCGTCCG CAAGAATTAG GGTGGGATCA TTGATCAGTG CACGCGCAAC TGCTGTCCGC 
TGTCTTTCTC CTCCTGACAT TTGTGCAGGA AAATGATGGG CGCGCTGCAC TACGCGTACT 
TTTTCTAGCA ATTCGTATGC GCGTGCACGC ACcTCACGGT AACTTTTTCC TGCGATAAGT 
CCAGGCAACA TGACATTTTC AAGCGCAGTA AAATCCCTCA GTAGATGATG AAATTGAAAA 
ACTAATCCTA AAAACTGTCT GCGGTATTCT GTCAGTGCGT GCTCATGCAA AGTGAGTACG 
TCGCATGAAA GCACTCTGAC GATCCCCGAA TCAGCGTGTT CCATTCCTCC AATAATATTC 
AGTAAGGTAC TTTTACCGCA GCCGGATTCT CCGGTGATTG CAACCTTCAC TGCACGCGGC 
ACGc TAAATG AT AC G TC AG A CAAAATCTGT ATACGTTCTG TTGCGCAgCa GAAnGCTTTT 
ACTTACTTGT TCGACAGAAA GAATTGGGTC AtTCATCGCG TAGCACCTCA GCCgGCTTGA 
GCaGGAGTAT TTTACGCGTG GCAAGGTACG TTGCAACAGA CGCAGAACCT GTGCCAAACA 
GAAATACAAA CAGTACCTCC TGAAAGAAAA TCTGCACGGG AATACGCTCC ACGTTGTAAA 
AATATTGCGT ACCAAACACA CTGAAAGAAG GGGTTTTCGT TCCCGAGAAG AGGGAGAACA 



3 60 
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GGAAAAACGC AGAATTTACA GCAGTCTCAA TGCACGCAAT TATTTCGTTA ACGTGGATAG 2100 

TAATGAGCAA TCCCAGGAGT ACCCCCAAGA GAGAGCCCAA AAAGCCAATC ATAATGCCAT 2160 

TGCCGATGAA CAGAATCTGC ACGTGACTGA CAGGGGCGCC AAGTGAAACG AGCATAGCAA 2220 

TTTCTTCCTT TCGAGTGCGA AT AGAGC GGC GCATGCTGTG ATAAATGTTT ACGGTTACCA 2280 

CCwTAAAAtC AAA t GAC AAG AAGTATCATG ACGTTCTTCT CTATGCGGAG CGCACTAAAA 2340 

AAaGC a CGGT TGTACTCCCG c CAGGATTCT GCC t TGAGAT scAGGAATGT GTTGTGCaAG 2400 

AAAGAAAAGG TAGCGATCGT CTCGCTCATG GTTATTTAGT TTGACTGC CG CGGTAATATC 2460 

aGGCGTCGTA CCAAATAAAG TGGTGCC CAT GTCCAGAGGA ATGTACGCAA ACGTGGAATC 2520 

TACTTCGTGG TATCCCGATT TGAAAATGCC CGTTACCGTA AG TTT ATTCC AGCCTGGCAT 2580 

TATCTTTTGT GTATCACTTC CTGACAGGGC AAGCGTGTCA ACCTGATCTC CGGTACGTAC 2640 

CGAAAGGTGG CGCGCCAGTT CATATCCGAG CACAATGGAG TGC TTTTT AC TCAAATTAAA 2700 

ACTTCCGGAT GTTATCGGGA GTGCACGCGC CAGCAACCTA TCCCGATGGA AGATATCTGC 2760 

AGGAACTGCA CGCACAAGCG CACCGTGTTG CCGATAATAG TTGCCTTGCA ATAAGGCATG 2820 

CGCTTCTATA AATGGATAAA AGGATTGATA GcCGCCTAAC GTCTCTGCAC GTTTTAtGCG 2 880 

TCAACACTGC CATATACACG AACGTGTGCA GAACTC AC CT GTAAAATGGT GCCAATAAAA 2 940 

CCCTGCTGGA AGCCGTTCAT AACCGAAAGG ATGACAATTA AGGTAAGTGC CCCAAAGGCA 3000 

ATGCCTAATA TAAAAAAAAG ACTGGTAATC GCGTTyGCAC TCCGCGCGCG C AC TG AATTT 3060 

AATCTGCGCA CCATAAAACA CATCCACCGc AGCGTTTGCA CGTGGGTGTT ACTCATCGTG 312 0 

TACTTCCTTt GTAGAGTAAA TTTCCTTCCC ACGCTCAAAT ACCTGCACGT CCTGCGCACC 3180 

TTCCTTTTGG TGAATTATTG CCTGTAATAC nCCGTTGCGA TAACGTTTTT CTAACAC 3237 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GTCGTATCCG nGnTAGTCCA CCGGTTCCTG AAAACACCTG CTGCGCTGCA CGGACACCAC 60 

CTTTCCCCCA AGTTCATCCA AAACAGGGTC GCAAACCGCT GCGAGTTGAA AAGCTCGACG 120 

CGTTGCTGCA CACGTAATCT CATAAATATT GCCCTGAAAC TCAATCAACT GCTGCATTGG 180 
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GAAAATCATG GGTCATCGTA ACACCTATCG CGCCGCGCAA CAAGCACTGC TTGACGTACA 240 

CGATCCCCCC TGTGTAGAAT GCGCACCGCG GGCAATGGCG CAGTAGGTTA GCGTACAAGT 300 

CTGGGGGACT TGGGGTCCCC GGTTCGAGTC CGGGTTGCCC GAGGAACAGC TGGCCAGCTC 360 

ACCCCGGTAG GCCGTTTGTG TCTTTGGGAA GCGGCGTGCC CGGCTGCGTC TCGCCTGTAG 420 

GTTGTAGCTT CCGTAGTGCG TTCTTAGGCA GGTGTGTAAG GAATGGCGGG GCTAGTAGGC 4 80 

ACTCAGGGTC TATCGGCGTG CTATTTATCC GTGCTTCCCA GTGTACGTGA GGCCCGGTGG 540 

AGAACCCGGT GGTTCCGCTG CGTGCAATGA GCGTGCCCGT TGACACGTAC GTATCTTTTT 600 

TCACCAGCAG CTCGTTCAGA TGGTAGTACG CTGTGTACAG CCCCGGGGCG TGCTCCAGTA 660 

CCACGCTCCA AmCCGTGGTT GTCCGTTGCT CTGCGAGTAA CmACCtCCCT GTACnTGcTG 720 

cATACAmCGC CGTTCCCACc GGAACTCCAA AGTCCTTCCC CCAGTGGTAC CTGnCAGAGC 780 

GCGTCCCGTC GGTGTACACA AAGACGCGCG CnTGCCCAAA CACGACGTAC ACCGTCGAGA 840 

TTCC AC CGGT TGTCGAAACG GCCCTAAAAA GGCCCGAGTC TGAGGGGAGn TCACGGTCTC 900 

nGGAnGCCTT ACAGGsGTCA CGCTGCACCT TTTTGCGCTC ACTCTTGTCC TGCGCAATGG 960 

CGGTATTCTT gCGATCTAAG CGTAATTCCT CACGGGGAAA TTCCTTTTTC TCAATGCGCA 1020 

GCGGCGCACG CCGCACATAT GGCTTCCCTC CCGGCACACG CACCTGCGCT TCAAGCATCC 1080 

AATCCCCCGG TTCCCAGAAA ATCGATATCC CCAGCAAGGC AACGTGCGTC ACATCCTGAG 114 0 

AACCTGCACG CGAGACGCCA GCGGTTGCCG CGTCCGTCCC TAACTGAGCA ATACCCTTTG 1200 

GGGGAAGCGC AAAAGCGCGC ACCGTCTTTG CTTCTTTACC CGCAGGGGTA CGCAGCACCA 1260 

GATGTACCTC AGTATGCGCC TTGTCCTTTT CTTGCAATGC CACTAAAGAA AAAGTGGCCA 1320 

TCGCACACGC ACCTTGGGAT ACCTGACGCG GGAACTGCAT AGCGATACGC TCGAAATGTG 1380 

CAGGAACCAC CTGACGCTCC GGCGGcGGCA CCGCAGcCGA ATGAAGCAGG AAGGACACCG 1440 

CGCTGACAAA GACACCAGAG AACAAGAGTA c TTCGC AC AG ACCACGCACC CAACGAACAC 1500 

TTCTTTTCAC CGACGGTGAC TGCACGCCCT GCGTCTGCAC TGCCCTTTTA GCGTTCACCC 1560 

CCGGTGCGCG TCCTACTCTC TCTGCACCCA TCACTCACTC CTCCACAAAT CTTGAATGAC 1620 

CAGCTGCGGT GTGCACGTTC CCTGAAACGT GTTACGCGTC ACTTGAAAAA CCGCGTCAAC 1680 

TACATCCCCC ACTGCAAACT CCTGTGCCAA CTTTTCCCCT GCTCCCCAGT AAATTGCGGG 1740 

CCATTTATGC ACCTGTGCAT CCAAGGTCAA TTTTACGTGC ACACGTTCTG TACGCCCAAA 1800 

AAGCGATGCA GAAAAAATTT TCAATCTCTT CGCCAAAAAG CACAACGGGG GATTGCCTTC 1860 

TCCGTACGGC TCAAAGCGAT CGACAAGGGT CAAAAGCCCC CGCGTCATCT GCGTAGCATg 192 0 
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cCAGTTCTGC ATCAAATTCT CCACACTCTT GCGCGCTTTC ATCAGCAAAC 
CCGCATACAG TTCCATACGG TGCAATAGCT GGGGAATTCG CTCAGAGGGA 
CCGCCGCAAA TGCATGCCCC CCATAGTCAG AGAACAAGTC TGCAAGGGGA 
AAAATAGGTG ATATCCCCGC GCCGAACGCA ACGATCCTAC CGCGTGCCCG 
TACAAATGAT CACACAAGGC ACGCGCAGCC TnCGcTCAAA CAGTTTGCAA 
AACGCCCCGA TGAATCTTAT CGCTACAAAC CACTGCCAGG CGGTTGCTGT 
ACTTGcACGC GCAAGAGGCT CAACAAGTGC ACGAGCACTC CTTCCTAACT 
TTCGTTCAAT TGCACCATTT TTCGTGCCTG cAGCGCGCGC TGCGAAgTTT 
AAACAGTTCC ACTGCACGGT GCGGACACCC TAACCGCCCC GTTGCATTGA 
AATACTCCAC CCTAnCTCTA CGGTTCCTAA CTTCTTCCCC ATGAGACGCT 
CAAcTcACGC AAACCCACAC GTGGACGGCC TCATTCATCG CCTGCAGACC 
AT 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY : linear 



PCT) 

TCAATGGTTG 
ATTGAAAAAC 
TCTAAGAGCG 
TCTGCCATTA 
GAATCCCCGT 
ACGTCTCAAG 
TTTTTCGCTG 
CGCGCATTAA 
TAAGCGGCAC 
GTATCGcAAA 
GTAGCGGACC 



1/1 3041 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2582 



(xi) SEQUENCE DESCRIPTION:" SEQ ID NO: 60: 

CAAACAGATC AGCGGGAGAA ACGTCTGGCA ATTnTAGCGC GCGCGCATCG GCAGGAAGAG 60 

ATGGCGGCAG CCCCGGTTTA TCTGCCACAA GCACACAGCG CACCAAGCCC CCGCGACGAG 12 0 

CAAGACAAAG CGTCTGCAAC TCATCGGGCA AGCAGCGGTA TGAACGCTCA GAAAGCTCCA 180 

CGGGGATAAG ATGTATCCCC CGCTGTGCGA GTGCCCTCAC ATCTGCCGGA TCGAAATCCC 240 

CCCACGGCTC ACAGCGTTCC AGATGCGCAA TGCACTGCGC AATGCGCTGA GCAAGTTCCA 300 

CGCGGTCTGA ATGAGTACGT ACGATCTGTT CCACGGCCTC TGCTGCTTCT ACCACCTGAC 360 

CTGCAACGCG ACACTCCTCT CCTCGGGTAA CATTCTTTGT CTGAGCGTCA GTGACGAGCG 420 

CAATGGCCTG CACGCACCGA GCGTCCAACG CGTgCAACTC TGCAAGTTGC TCACTTGCAC 480 

ACTCCCGCAA CTGCACATGC ACAGCACCAA AGGAACGCAA CGCCTGcAGC GAACGCTCTT 540 

GCTCAGAACC GAGCACCAGA AGCGTTACCT TTTTCATAGG AACTATCACC GTGTATCCTC 600 

TTGTTCCATC CGACTCACGT CTACCAGATT CTTCTTAGAC ATCTTCCCCC GCACTACTGC 660 
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AGCAACCTGC TGATCACCGA GGTACACCGT TATTTtCCGT ATGCATGCCC GCGTCTCAGG 
AATCTTAACT TTTTCAAAGA GGTTAACACG CTGCGTTGTA GTCCGCAACT CTGcACCCAG 
GAGAAGCGCC TGTTCGTCGA GAACATGCGC CTCCAAGTCT AAGCTTAGCA CTTCC TGC AT 
TTTGCGCACT GCAGTATCCA CCCACAGAGG AACACGATAT AAGTCATAGG GAGGACAAGC 
AAAGTGCACT TCTAAAAAGC AGGGAATACG CACACCTGCA ATGCTAGCAT ACGTTTTCTT 
TACCTCTTGC ACGC GGAGCA AACGCGCGTC GAACACACCG CTTTCAGAAA AAACTGCAAC 
CCACTGCTGA ACATCCTGAC GCAGGGCATC TGCACGGGAA CGTACTTC AG AAGCGCGCGC 
CTCAACGGCA CGGATCTCAG C AT AC AA C TG CTGCTTTTTA AGCTGAAGCG TAGGGAGAAA 
ACGGCGAAAC GTCTTGAGCG TCTCTTTTTG ACGTTTCAGT TCATTTTTGG TTAAGCGCAC 
CGCCAtCGGT CACCACCCTA CGCAGGCCAA TACGTGTTAA TCAAATCAGA GCGAATCCCC 
GTCTCCTCTG GGGTGAAACA CCGGCCCAGA ATTTTCCACC CCGTATCGAA CGCCTCTTCA 
AGCGGAATAT TCACCGAAAG ATCCATGAGc TGCGCTTCAA ACAGCCCACC GTATGTGAGC 
AGTTTCTCAT CCCACTCGCT CATGGCAAAA CCCATAGATC TTTTCTCAAG CGCATCACGA 
TAGGCGGCAT ACAACTTAAT CATATTATCC ATAAGCGCGC GATGATCTGC ACGCG T ACGC 
CCGTTTACGT TCTG CTTAAG ACGGGATAGA CTCCCGAAAG gTTCAATGCG CCCGTTCTtC 
AGATAAAACT GaCCCTCAGT AATGTACCCC GTGTTATCAG GAACCGGATG CGTAACATCA 
TCCCCTGGCA TGGTGGTAAC GGCAAGGATA GTCACTGACC CTGcATCATC AAAATC G AC C 
GCCTTTTCAT AGCGCGACGC AAGCTGGCTG TACAAGTCAc CCGGATACCC ACGATTCGAG 
GGAACTTGTT cCTGAaTAaT CGCAaTTTCC TTCATAGCAT CAGCAAAaTT AGTCATGTCG 
GTTAAGAGCA CCAACACATC CCTACCCTTC AAGGCAAACT GCTCGGCAAC TGCAAGaCAC 
ATATCAGGGA CCATCAAACA TTCTACGGTA GGATCTGAGG CAGTGTGCAC GAACAGGACT 
GCCCTACTCA ACGCTCCTGC CTCTTCCAAT GCACTTTTAA AATACAGGTA ATCGTCATGC 
TTCAGCCCCA TACCCCCGAG GACGATGACA TCAACCTCCG CTTGCATTGC AATACGGGCC 
AGCAGTTCGT TGTACGGTTC CCCTGAGCTA GAAAAAATAG GCAACTTCTG AGAAACAACC 
AGCGTATTAA ACACATCAAT CATGGGAATA CCCGTGCGAA TCATACGCCG CGCGATAACC 
CTCTTTGCCG GATTAACCGA AGGACCGCCA ATTTCCACCC TCCCTTCCTT TAAGGCCGGA 
CCACCGTCTC GGGGAACGCC AGAGCCATTA AAAATTCTCC CCAATAAATA ATCTGAGAAA 
CTCACGAGCA TACCCCTCCC CAGAAAGCGC ACCTCGCTCC CGGTGGAAAT ACCCCGGCCT 
CCCGCAAACA CCTGcAGGGA AACTACATCC CCTTCAAGCT TATTCACCTC AGCAAGCGAA 



720 
780 
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900 
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TCGCCAAACG CCGTTTTTAC CCGGGCCAAT TCCCCGTAAT 
GTGATGACAG AACCGTTGAT CGACTCAATC TTCTCGTACA 
ATCCCCCGTA TAATTCCCTC TGcTTCGCTG TCGATTTTCG 
GCACGTATCT CCTTTTCTTT CTCCACAAAC GCCTCAGAAT 
TCGATAAACA TATGCCCAAG CTTGCTGAAG TATGCCCGCG 
GCTAAAACAC TGCCAAGAAC CCGCATGACG ATGGCATAGC 
GGTACTGCAC TATCGACTGT GTCAAAAGAA TTCTGCTGCA 
GAGCCTTTCA GATATACGAG AAAGTCCTCC ATAC TTGTGC 
ATCATCTGCT CCACCTCTGC CCCACGGCGC AGAAAAGAGC 
GCGTCAAGCA CACTTGGATA CTTAGACCAT GAATCAAGCG 
CGCGCGTCAG AGCgyTnCTC GAGAAAGTCC GTGAAAgCCC 
TGCGTTACCG GTTCTTCGAA ATTACCACCT GcCGGAGAAA 
GATCCTTTCT CTCCACTCCT CAGCCGGACC ACACCAGCCC 
CAAGACTCCA GGTACGCAGG AAAGGCCTCC TCCCCCGGAA 
ATTTCACGCA GGGCCTGTGC CCAACGGCTC GTAGAtCCGC 
CCATCTGACG GTAATATTCT GCAAGCGTCA CTCCCGTGTA 
CAACGGGCAT AGAAGAAGTG TTGCACACTA TAACCGTCCG 
TGCGAGGATC CGTAAGATCA GGAAACTCCC GCAGgTtCTC 
CCCCACACGC AGCAATCACT ACCACGTCCA CATCCGCATT 
GCACCGTCTT TCCCGCACCA AAGGGACCGG GAATACAGTA 
GGAAAAAGGT ATCTATCGTC CTAATGCTCG TTACCAATGG 
CTGcGTAACA ATGGACGGGT CGCTTCACTG GCCAACGAAA 
CATGTCCCTG CGCGTCACGG ACCCGCGCAA TCACATCGTG 
TCTGAATGAA GACAACCTCA TAGGAATCCC CCATATGAAA 
TGAGCGCACC CTCTGGGGTA TACCCGAGCA CGTCCCCACG 
AAACATGCGG GGTAAACATC CATTCACTTG TCCGAGAGAG 
GCTCCAAGAA ATACCCAACC TTTTCTGCAA GCAGCGGCAA 
ACACCTGACC GAGCAAACCA GGACCTAGCT CAACAGACAG 
CACGGTCCCC AACGGAAACC CCTCTTGTGA TCTCAAACAC 




13041 



GCACCCCCTT TGCCCGCACC 2460 

CCTTGTACAT CGTCTACTCC 252 0 

TCGATTCTCC CTGGAGAAAG 2580 

TCCAGGCGCA ACAGTTGTAA 2640 

CGTCATCTTT TGATTCAAAC 2700 

AGTGCTTTTG ACGTGCAACA 27 60 

GATACACCGA ATCAAGAAAC 2820 

CCTCTTCGCC GACGACCCTC 2880 

GACCGTACGC AACAGCCCGC 2940 

GATGCACCGC AGgATACCTG 3000 

CAACCACTTT CAATGTAGCC 3060 

CCGTCCCTCC AATAGTTACC 312 0 

GCTCATAAAA GGCTGCGATA 3180 

TCTCTTCCAA ACGCCCAGAC 3240 

CAGCAAAAGA ACATCCAACC 3300 

CACTGAAGCC TCACGAGAGG 33 60 

CTCCATAAGC GACCGACCAG 3420 

AACCACCTCC CCTGCACGCT 3480 

GCGACTGGTA GAATGCTGCA 3540 

CGTCCCCCCC TTGGCCACCG 3 600 

CTCAGTCGGT TTCAAAcGCT 3660 

TGCCATGGTC AGTTCGTGCT 3720 

CACGCGGTAC GTCCCTGCAG 3780 

GGGAACCATA ATGCGGTGTT 3840 

CACCACGCGC TCACCCACTG 3900 

GGCGGGCAAA TACACCCCGC 3960 

CGGATTCTGT AAACCGTCGT 4020 

CAAATCGCCT GTAAACTCAA 4080 

TTGCAACTGT GCCTCACGAC 4140 
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CACGAACACG AATAATCTCC GCTTTCAAAC GCGCATTTCC AACATGCACG TATCCGACCT 4200 

CGTTGAGCGA AACGACACCC TCGAACGTAA CGCTCACCAT ATTGCCGTTG ACCGCAGACA 4260 

CGATACCcTT CGTTTGCGTC ATGTATATGC TCCAAGAATT GTATAATAAA GAGCTGCGTA 4320 

TGCTTTGGAC CCCGCCTGCA TTGTGAAACG CGACCGATGC GTAAGCAACA TCAGCTGCAG 4380 

CCCGTACAAA AACACTGCCT CTGAGGAAAA CGGGTCAAGC GGCCTACACC CCTCGACAAA 4440 

GGAAAAGCGC GCGTCGTTCA AAAAATACTC CGCTTCGAGC GGATCGTCCA AAGAGACCGC 4500 

AACACGAGCC GCACGAGCCA CCGACTCCTG CGCCACAGGA CACTGCCTTA CTTCAACAGG 4560 

AGTATCCCAC CGCAGACGGT cCgcGCGCTC ACgcgCAAGc GCACAGCGTA ACGCGTACTC 4 620 

AAACTCGCCC CATCTATCTA AAACACGCGA TCCCGTGGAG TCACGCGCCG GCACGGGACA 4680 

CAACGAGATA TTTCCAAGCA CCGCAGCATC CTGCCGACCC AAGAAACGTA GCGCACAATC 4740 

CAAAAAATCC TGATAACGCA AAGGAGGCAC CGCGCCGCAT AGAAGAGATG GTAGCTGCGT 4800 

TATAAGGTAG CAATAGGAAG ACATCACAGC TCcTGCGCAG CAGmCTGAGC ACCTCGGCAA 4 860 

CACGCGCAGA AACATACGAA GAGAACAACT GAGCAACgCG GCGGCGGAAA AAtcATAGTa 4920 

CGACCCGCCC TCGGCAGGGA CTATCCTAAA CCCTGCCGTA AGGCAATCGT CAGACCTCAA 4980 

CTCTACCCCT GCCGAAAGCT GCTCCTGCAA CGCAsCACAA AACACCCCCT CAAGCGTCCG 5040 

AAGATCAGCA GGAGAGAGGA TGAGCTCTAG CTTATCACCC TCCGCTTGAA CCCAGGCAGA 5100 

AACGACACGA GGAATAAGtC ACGCAAAACA CCCGCATCGT aG t TGCGCCG TCTCCATCGA 5160 

AATAATAGCC CGAAGAGAGC GAGTCACCGA ATCTTGAAAG GATAATAAAA CGTTGCGACT 5220 

CGCCTGCGAC AGGCCGCAAG AGACGACGAC TCGATCCGCT CTGCCTCCTC ACGCGCAGCG 5280 

GAACAATCCG CTCTGCCTCC TCACGCGCAG CGGAACAATC CGCTCTGCCT CCTCACGCGA 5340 

CTCACCAAGC AAACGAGACG CCTGCTCCTC GGAAGAGGnC AAnCGCTTCG CGCTTAATTC 5400 

GGTCATCAGA TCTTGCAGTT GAATCTCCAC TTATAGTTCT CCTCGCAGCC CTCCGAGTAT 5460 

ACTAAAAAGT CCCCACCGGG AnAAGGCATA ACACAnTTCG ACCA 5504 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8467 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



CT^P/13041 



Printed from Mimosa 02/03/22 07:26:01 Page: 524 



WO 98/59034 




523 




PCT/O^ 


IF13041 


TTGTATTAAC 


CCATTGCCTT 


ATCCTTTTTC 


ACCCAGCGCC 


AGTTCACGAG 


ATGCATTACG 


60 


TTCCTCCCTT 


GGAAAACGGA 


GAATGACTTC 


CGTTATATCC 


GCCCGTTCTC 


TAGGGTGGAG 


120 


ACAAATCCAT 


AAAAGTAACG 


CCTCTTTTTT 


ACTCCCCCAT 


ACCTCATCAC 


CGCATACAAA 


180 


GCAAAACAAA 


ATCACTGAGG 


TTAAACATAC 


CCACCGTGTT 


ACGCTGTACG 


CGAATCCACA 


240 


GATCGCATAA 


CCCTCACCGT 


TTTCTCCGAT 


AAAGAATCTG 


CATCACCACA 


AACAGCATTC 


300 


CTATAGCATA 


CACTATTCTC 


AG AATGAC TT 


GGTTGAGTAC 


TCACCAGTCA 


CAGAAAAGCA 


360 


TCTTACGGAT 


GGCATGACAG 


TAAGAGAATT 


ATGCAGTGCT 


GCCATAACCA 


TGAGTGATAA 


420 


CACTGCGGCC 


AACTTACTTC 


TGACAACGAT 


CGGAGGACCG 


AAGaGCTAAC 


CGCTTTTTTG 


480 


CACAACATGG 


GGGATCATGT 


AACTCGCCTT 


GATCGTTGGG 


AACCGGAGCT 


GAATGAAGCC 


540 


ATACCAAACG 


ACGAGCGTGA 


CACCACGATG 


CCTGTAGCAA 


TGGCAACAAC 


GTTGCGCAAA 


600 


CTATTAACTG 


GCGAACTACT 


TACTCTAGCT 


TCCCGGCAAC 


AATTAATAGA 


CTGGATGGAG 


660 


GCGGATAAAG 


TTGCAGGACC 


ACTTCTGCGC 


TCGGCCCTTC 


C GT ATT TG TT 


CCTTACCAGG 


720 


ATGCGTACTC 


CCCTTCGTAC 


AGCGCCGCTT 


CTCTTGC TGC 


TCCTATGCGC 


ACTTCCCCGG 


780 


GCGTTGTGTT 


GCTCTCTAGT 


GCAc TGCGCG 


GGGTACCATT 


CGATGTACCG 


ACCCCATACG 


840 


TTTCCCGTCG 


GGCGAACACT 


ATCGACGCTG 


CCACCTTTGA 


AGACGCTCAT 


GTACCTGCAT 


900 


TATTTC CCGC 


GCTCTTTGCG 


CTTTGCAGGC 


ACGCGCCCaC 


ATTCGTGTAC 


GCAGAAAGTG 


960 


CCCATGAGGT 


GATGCTCAGC 


CGTTTTCTGC 


AAC AACAGC C 


ACATGCATGC 


GCCGGTGTCT 


1020 


TTTTTGTCCT 


TCCTGACTCT 


GCAGCGCGCG 


GACCACACCA 


TGCTCCTGCC 


GTGCAGGGCG 


1080 


CACCTCCCCC 


CGTCGACACA 


GCGGGCGTTG 


CGTCTGCTGT 


CCGTGGCGCC 


AGCCGGACAC 


1140 


TACCAGCTGT 


GTATCGACAG 


TATGTTCACG 


CAGCAGAGGC 


AGCGTGGGCA 


GAGCTCGCAT 


1200 


CCACCGATAT 


ACTGGCCGCT 


TAC t TGCAGG 


GCTCCCTCGG 


GACCGCCACA 


GAACGCGCCT 


1260 


TCAGGCACGT 


Gc t ACGCAGG 


TAGACCAGTG 


GATACGCGCC 


CAGCTGCATC 


TATCAGAACC 


1320 


TGTCCTTCCG 


CACGCGCAAG 


CGCTATCTCA 


TCACACTGTC 


CATGCGGGAG 


GGACCTATGA 


1380 


CCGCACGTGA 


GTTAGATGCG 


TACTTTCGTA 


GTTTTTTGAA 


CTTTGGACCG 


TTCGTCTCCT 


1440 


GTGATGTCGC 


TCTCAACGGC 


CTGCAGGTAG 


CAAATAGCGG 


TGCCCCCGTG 


CACAAGGTTG 


1500 


CCTTTGCAGT 


GGATGCGTGT 


GCACAGTCTA 


TCGACGCAGC 


CGCCCGCGCC 


GGTGCACGCA 


1560 


TGCTCTTTGT 


CCATCACGGT 


CTTTTTTGGG 


GACGCATAGA 


GCCGCTTACC 


GGTATGCAAT 


1620 


ACCGACGCGT 


ACAGGCGCTC 


CTGACGCACG 


ACATAGCGCT 


GTACGCAtGC 


ACCTACCACT 


1680 


CGATGCACAC 


CCGCAGTACG 


GTAACAATGC 


GGGCCTTGcT 


GCGCGAGTCG 


GTCTTAGGCA 


1740 
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AGGTGGTCCT TTCGGTTTTA TCCGTGGAAC TGCCGTAGaC TCTGGGGGAC GGTGGCAGAA 1800 

AACACCACCC CCTCTCAGGA GGCAATGCAG CAGCATGCAG CGTGCACAGC ACCCGATACC 1860 

CACCGCGTGA CGCATGCGAA TGCAATATCG CCGAGTGCCG GGCTATCTCT CCAACAAGTA 1920 

GTACATCGCC TCTTCCCCGC AGAAGAGCAA CCCGTGCGCC TGTTACCGTT TGGGAAACAG 1980 

CGTATCGAGC GCGTGGGTAT AcTGTCGGGC AAAGCAGGCA CGTAC CTTGC GGAGGCTATC 2040 

GCGTTAGATC TGGACCTGTT TATTACCGGG GAGATTGAAC ATTCTTGCTA TCACACCGCG 2100 

CGCGAGCACT CTATCTCGGT AATCGCAGGG GGACACTACC . AAAC AGAAAC CGTAGGtTGC 2160 

AGCTGGTGGC GCGCAAcTGC AACGGGATAC AGGCATAGAA ACGCTTTTTC TAGACATTCC 2220 

CACGGGGATG TGATACGCTC GCGCCCGTTA AGGGTGGATA CAATGAAACT CACACGGATA 2280 

CAGAAAGAAA AGTGGATCCC GCTTTTTGCC GCTGGATTAG TTGTTGTTCT GGATCAGTGC 2340 

GCTAAATTGT TGGTGGGTGC TTATGTGCCT ACAAACACCT CGGGCGTTCG CGTGCTCGGT 2400 

GATTTCGTGA GAATTGTTCA CGTGTACAAT GTTGGCGCCg CTTTCAGCAT TGGCCATCAG 2460 

CTAAATCAGG TTCTGCGTAC GCTCGTGCTC GGTATCGTGC CGCTAATCAT TATGTTCCTT 2520 

ATTGTTTTCT CCTATTTTCG CACTGACGCC TTCTGTCCTG TTCAGCGCTG GGCCGTGTCA 2580 

GGGATT ATCG GGGGAGGGAT AGGGAACTTA ATCGATCGCT TCCTGAGGCC AAACGGGGTG 2640 

CTCGACTTTA TCGACGTAAA GTTCTTTGGC ATCTTTGGCT TTGAGCGCTG GCCCGCTTTT 2700 

AACATTGCAG ATGCGGTCAT CATGACCTGT GGTTTGCTCT TGATCATTTC GTTCATAAAA 27 60 

CAAGAAAAAG AGATCAGCTC CCAACCCTCC TGCAATGAGA CGGGGGGCGT TTTTCGC AC G 282 0 

TAGAGCTGGG CCGTGCGCGC ATGTCCGCGT CGGCCGTTCT AGTTCGCGTG CCCCTGTGCC 2880 

CGCAATGGTT GCTTTGTTCT CCGCAAATAC CGCGCGTGTG TGCCGCGCGT TGCgcTtCCG 2940 

GCGTACCAGG GCGGTACgcG CGAGsgcCTC ACAGCACTCA GGATATTAGC CCATGCAGAT 3000 

CTTCGATACT CACGCCCACA TCGGTCTTAT TCACCCAGAT CCCGTAGAGC GGCTGCGgGT 3060 

AGTACAAGAG GCACGACGAG CTTCTGTCAC CCGCATCATG AGTATTTGCA ACAGCCTTCA 3120 

TGACTTTGCC GCCGTATACG AGACGCTCCA GTTCTCACCC TCTGTCTATC ACGCCGTAGT 3180 

GTCTCCCCTT CTGAGGTCAT GGCCCCGGGG AAGGATTGGA TAGATACTAT TCAAAAAAGC 3240 

CTACAACTCC CTCAGGTAGT TGCCTTAGGC GAGACCGGAT TGGACTACTG TAAAAAGTAC 3300 

GGTGATAAAC GCTCCCAGAT TGGGCTTTTT ATCACTCAAT TGGATATTGC TTCAAAGGCA 3360 

AAAAAACCAG TTATCATCCA CAACCGTGGT GCGGGCCAGG ATATCCTGGA CATCCTCAGC 3420 

GAGCGCATTC CCGACCAAGG CGGTGTGTTC CACTGTTATT CTGAGGACGC AGAGTACGCA 3480 
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CGTATGGCGC TGGATTTACC TGTGTACTTT TCTTTCGCGG GGAATTTAAC TTACCGGAAT 
GCACGAAATC TCCATGAGAC TGTATTGGCC CTCCCGCTTG ACCGAATTCT AGTGGAATCC 
GAAAGCCCGT TTATGTCCCC CGCCACGTAC CGCAACAAGC GCAACCGACC GGCGCACACA 
GTTGAAACCG TGGAGTTCAT GGCTGAGCTC CTTGATATGG ACATGCTTGA GCTTGCCGAC 
CAGCTGTGGA AAAACAGCTG TGCGTGTTTT CACCTTCCTG AGTGAGCAGC AGATGCAACA 
ACACGCCTTA TATCATCCGG TTTCTATTGG CCCGTTGTCT CTCAAGGGGA ATGTGTTTTT 
TGCTCCCGTT GCAGGCTATT CTGACAGTGC GTTTCGTTCA ATTGCCATTG AATGGGAAGC 
AAGCTTCACC T AC AC CG AAA TGGTTTCGTC TGAGGCGATG GTGCGCGATT CACTCAATAC 
CAAACGTTTG ATTCGGCGCG CGTCAAATGA GACGCATTAC GCTATCCAGA TTTTTGGTTC 
TAATCCTGCA GTAATGGCAG AGACGGCAAA ACTAATCGTC GATAGCGCGC AGCCGTCCTG 
TATCGACATC AACGCGGGAT GTCCTATGCC TAAAATCACT AAAACAGGAG CCGGAGCCGC 
ACTCACCCGA GAACCGACGC GCCTC TATGA AGTGGTAAAG GCGGTCGCCG ATGCTGTGTa 
CgcGCAAGAC GCGCGTATCC CAGTGACAGT AAAAATTCGT GCTGGGTGGG AAGAGGCACA 
CCTGACATGG AAGGAAnsTG CGCGTGCGGC AGTAGACGCA GGAGCACAAG CGCTTGCGTT 
GCACCCgCGC ACCTGCGCGC AGTGTTACGC GGGAGAGGCA AACTGGG AC A TAATCGCAGA 
CCTCGTGCAG TGCGCGCGTG GGTGGGGAGA GGTTCCCGTG TTCGGCTCAG GGGATCTGCA 
TGCGCCTGAA GACGCACGGG CAATGTTAGA ACACACCGCA TGCGCGGGGG TTATGTTTGC 
CCGCGGTGCT ATGGGCAACC CGTTTATTTT CAGACAAACC CGTCAGCTTT TAACTGAAGG 
ATACTACACG CCCGTGACGT TTGAGCAAAA GcTACGCGCA GCCTGGCGCG AGCTTCACCT 
TCTGGCACAA GACGTGGGAG AAAGCTCAGC CTGCAAGCAG ATGCGCAAGC GTTTTGTTTC 
GTATGCAAAG GGTGAGCGGG GTAAAACGCA ATGGTGTCAG CGCGCGGTGC ATGCGTCTTC 
CTTCGCAGAC TTTGCAGCAG TCATTCGTGA CGCGTGTCCA TGTATTGGTT TATAAGTTGC 
ACGGCTTTTC AAACCGCGTG AAAAACGTAC GCTTCCGGCG TACCCCAACT TACTTTGTCC 
TACAGGACGC GCAGnTCCCT CGATAGAAAG CGTGACTATA TCTGTCC TGC GTGCAACTTG 
TACAAAGCCG GTTCCTGTGC CAGAATCGTG CATCCGCGGT GGCAGGGAAT CCTGGTGAAA 
GAATGTGTTT CTAAAGAAAA GCGAATCTAT GACCTCAAGC AGCTCCTAGA GATTTCTAAG 
AGTTTGAATT CTCTCCTTGA GTTTACTCAC CTGGTAGAAG CCATCCTCTA CGTCGCGATG 
GCCCAGACCA AGACGCTGGG GGCAGCGCTT TTCACCAAGA AAAACGCCGG TATGAAAAAA 
TTGTCTTTGA GCCGCAaTGT GTGCGGCTTT GACGTTTCCC ACCATGCACA GCTGATAATC 



3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
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4680 
4740 
4800 
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4920 
4980 
5040 
5100 
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TCGGAAGAGG ACCCTATTCT CAGACTTCTG GACGAAAAGG CCTGTTGTCT TTCTCCCgAA 
gAgGTACAGA GCGCGCTCGC CCCCTCAAAG AGCGTACGTT CGCTCCyTGA CTTGCAACCT 
TCGCTCTTTG TTCCACTAAG AGCAAAGGAC CACCTTGTTG GTCTTATCCT TTTAGGCAAG 
AAAAtCAACG TA CACGAAGC CTACACTCCC TACGATCAGA GCATCATCAT GGATATTGCA 
CAGCTTGCTG CTATTGCCAT CAACAATGCG TTACTGCTTG AGCAAGCTAC CACTGACATG 
ATGACCCAGA TGAAGCTCAA ACACTACTTC TTTGCCATGC TCACCGCGAr CTC GAT AC AC 
TCAGTACACA AGAGACCGTA TCTGTTCTCA TGCTTGATAT CGACTTTTTC AAACAGATCA 
ACGACACGCA CGGTCATCTG TGTGGCGATC TAGTTCTCCA ACATGTGGCA GAAATTATTC 
GATCCTGCAC CCGTCCATGC GACATCGCCT CTCGCTATGG GGGAGAAGAA TTTATGCTCA 
TGCTATCCAA CAACTCGTCT CGGGaAGctG CGCACGTTGC AGAAmgCATT CGCGTGGCAA 
CCGAGCAATT GACCATCCCC TACCATGAGG TATCAATTCG AGTCACTGTT TCTGCAGGCG 
TCGCAGAATA CCTTCCTAAC CAAGAATCCG CCGAAACACT GATAAAGCGT GCAGACAGTG 
CGCTGTATCA AGCCAAACAA AATGGCAGAA ACAAAGTCGT CATCTCAGAG AAAAACATGT 
GCTCATCTCA GGAATAAACC GATACTGGCG GCATGAGTGT GATCAGGAAG CCCTTCAGGT 
ACTCGTACAC CAATGTGACC CTTTCCCTTG TGCTCGCGAA TGGGGCGGTG TTTGTGATCA 
CGTCGTTGGT TGAATCACTG GGTATATATC TGGCGCTCGT GCCAGGACTC GTACGTTACC 
ACCGTATGTA TTGGCAAATA TTCACCTATC AGTTCGTACA CAGCGGCGTG TGGCACTTGC 
TTTTTAACAT GCTAGGACTA GTGTTTTTCG GGCAGACGAT AGAAAAGAAG ATGGGATCTT 
CTGAAATGCT G TTG TTTTAT TTGCTTGTCG GTACACTCTG TGGTGCGGGT GCGTGCGCGG 
CATATCTGTG TGTCGGTCGG TTGAACGTAC TGCTGTTGGG GGCGTCGGGC TCCATCTTCG 
CAATACTTTT TTTATTTTCG GTTATGTTCC CCCACTGCGC TCATTTATCT ATGGGGTGTT 
ATTCCTATCC CCGCTCCTCT GCTCATTGTA GGATACATTT TGTTTGAAAT TTTTGATCTA 
TTTTTCTCTC GTGATAATGT TTCTCATCTT ACCCACTTGC TCGGTG TCCT TTTTGCGTGG 
GGATATATCC GTATCCGGTT TGGCATCAAA CCATTGAAAG TGTGGAGCAT TGTCCCGTAA 
CAGTCGAGGC AGTGGGAGAT ATGTCTTCGT CGTGCTAGCC TGCGTATTTG GTTATACGCG 
CGCCGTGCAC GCTGAGGTTT ATACGGACCC CAGCACATCG GGACATGTCA CGATTTCTAT 
TCCCATATGG GCTTyTGTCG AGCCCCAGCC GGGTGTCATG ACCCAGcAGs GGAGTCCCCG 
AGGACTCCGC CTCnCCAGAC CTTGCGAGAA TTAGGGGCGT TCGTATTAGG CGGTGCTGTG 
TATGGGTGGC GGTTCTCTTA TACGCCaAAA GAAAAGAAGC GCGCCGTCAT GGAGCACTTT 



5280 

5340 

5400 
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5580 

5640 

5700 

5760 

5820 

5880 
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6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 
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ACCCTCACTC CCATTTTCCC CCTACCGCCC GATAGTCCTC AGATAAGTCT GCGTCACGTA 
CGGACGCCGT ACCCCTACAT CCAtGCCGTG CAGAGTACTC ATTAGACGCC AGGCACGCGA 
CACACATGAG ACAGAGCAGA AACCTAACGT ACCAACGTGC GCAGGGCAGA GGAAGAGGAG 
AACGGAAAGA GGAACTAAAG GGAGTATATC ATGCATATCA CCGCGCGATT GTAGACGCAC 
TACGGAAAAC GGTTAGAAAG ACACAGAAAA ACAAGCCAAA AGAAGTAGAA GGAATGCTAT 
ACGTTAAAGA CAATCCCCGC CTCTTTGTAG AGGCGGGGGA ATTTGTCGCA GAGCTCTCAC 
TCAGTGTCCA CTTCACAAAG ATAACGC CCT ATAGCGTATA CTAGTAGCAC GCACCGAGTC 
CTGACCGCTA CCCGCGTGCG AGCAGACGGT TCACCCGCTT CACAAAATCA ACCGACGAAC 
CTACGTCCAT GCCTTCAATG AGCAAGGCTT GATCCAGAAG AACAAACGCA AGATCTTCCA 
CAAACGCCTC ATCCGTACTT TCTTTTAGTT TTTGTACCAG CGTATGACTT GCGTTAATTT 
CTAAAATTGG CTTTATCTTT GATTTATGCG TTTGTCCCGT GGCGCGCATC AAGCGCTCCA 
TCTGCACCGT GGGATCATTC TCATCGATAA CAATGcAAGA CACCGAGTCA GAAAGCCGTT 
TTGAAAGACG AACTTCCTTC ACCGAATCAG ACAGTATGTG CGTCAACCTT TCTAGTAGCG 
GCTTAAAACC CTGTTCCCTC TGCGCGGCGG CGTCTGTTTC TTCGTTGGGA CGCAACTCCT 
CCTCTGAACC TAAACGATTA ATTGCCCTTA ACTCCCACTC CTTGTATTTC GAAACAGAGG 
GCATCACGAT ACCATCTATG TCGTCTGACA TAACGAGCAC TTCAAAACCC TGcAAACGAT 
AAGACTCTGC ATGGGGAGAC TGACGCAGCA CACGATCGTC GTTTCCCGCA ATGTAGTATA 
TCGCCTTTTG ATCC GGTTTC ATGCGAGAAA CGTATTCGGC GAAcTCGTCC ATCCGTCTTC 
TGGAACAGAC TCACTTAGAG TCCTGAAACG AACAAGTTCC AGCAGCTGCT CACGGTGCTC 
GTAGTCGCTG TATAAACCCT CCTTCAAGGG ACGATTATAC TGCGTGATAA ACTCATCGTA 
CTTTTTCCCG TCACACTCCG CGAGTCTCTT AAATTCCCCG AGCAACTTTT TCACCGAAGC 
CGACTTGATT GCTGCAAGGA CTCTATTTTG TTGCAGAATC TCACGGCTTA CATTCAGGGG 
CAGATCTTCG CTGTCTATTA CACCGCGGAC AAAACGCAGA TACACTGGCA ACAG TTCCTT 
CTCGTCATCA GTGGATGAAA ACGCGCTTAA CGAATAGCTT TACCCCCGGC TTATAATCTG 
ACGTGAAAAA GGTCAAAAGn GCGCTTTTTG CCGGGCAAAT AAAAGAGCGT nGACGTACTC 
CTGTGTA 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4354 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 



7020 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CTCTTCAATA ATGTCTTCCA TGCACGCAAT ACCCGAAACG CCGCCGTACT CGTCCACCGC 60 

GATCGCAATG TGCACGTGCC TGCGCTTAAA CTCTCGCAGA AGACTGTCAA TTCGTTTGGA 120 

CTCGGGGACA AAGAAGGGTT ACGCAGCAGT CTTTCTAACC GCACCTCCTG TGGCCTTCCA 180 
AACAGCTTTA TTAAATCTTT GACGTACAGC ACACCCACCA CATTATCAAT AGTTTGTTCG . 240 

TAGACAGGAA AGCGTGAGTG TCCACTCTCG GTTACCTTTT CAACGAGTGT TTCACCGCTC 300 

ATAGAAAGCT CAAGAAAATC CACGTCAATA CGCGGTATCA TCACCTCGCG CACCGAAGTG 360 

tCAGAAAGAT CCACTATACC GCGGATCATA TCCTGCTTTT CTTCATTCAG CGGTTGCTGA 42 0 

AAAATATGGG TAACAGCGTG CCTGCGCCTC AACCAGTCTA TGACTCCCAT GGTATACCCG 4 80 

ATGATAGCAC CCGAC AGTG t GCGCCAGTAT GCGCTCCTGC AAACGCAACA TCTCTTGACC 540 

GGGAGAATTG TCCTGGTGAT CCATACCGCT CAGATGCAAA ATGCCGTGGA TGAGCACCCG 600 

TTTAAATTCC TCGTGCGCGG CAACGTGAAA ACGTTCACTG TTTTCACGCA CACTTTCAAG 660 

ACTGATGATA ATATCACCAG CAAGAAAAAA ACGCGTCCCT GCGTCATCGC AATACTCACC 720 

ATCGTTCTCA AAAGACAGCA CGTCAG t GGG AGAATCAATA CCACGGTAAT CGTAATTTAG 7 80 

CCGGCGAATA AACGCATCAG TGCAGCAGAC AATGGAAAGA TCCCAGTGGG AAATAGCCTG 840 

GGAATCGAGC ACCGCACACA CAAACGGCGC AACTTGACCA ATCCAAGGAG GCGGACAAAA 900 

GCcTTCGCAG GAAACAGAAA CTTTATTCAC CTCGGACATA AAGATTACTC CTTATACGAT 9 60 

CCTTGGGCTA CGGACACGAG CTGCTGCTGA TCAGAATCTC TTTGGTGAGG ATACTCTATG 1020 

CGGGAATGGT AGTATCCTGC CAGTATTCTC ACAAAACACT CCTTGACGAC CTGCACATCC 1080 

CGAAACGTTA AATCAGAATT GTCAAGCTGa TGTGTTTCTA TCTTCTGTTG CACAACCTTA 1140 

TCGATAAATT TCCCTAGGCG GGGGATCGTC GGTTTATTCA ATGTC CT AC A TGACGCTTCA 1200 

ACCACATCAG CAAGCATCAC CACCGCAGAC TCCTTTGTGC GAGGAGGAAC CCCCGGATAG 12 60 

GTAAAATCTT CCCGATCAAC ATTCGGATCG AGTTCCCGCG CCTTCTCGTA AAAGTATGTA 1320 

ATAAGACTAT TACCGTGATG CTCTGCAATT ATATCGATAA CCTCCTGAGG TAAGCGGAGT 1380 

TGATGTGCCT TTTCTACCCC CAGCTTTACA TGACTCCGAA TTACCGTTGC AGAAAGCCGT 1440 

GGATTTAAAT CTAAGTGTTT GCTATCGCCC GTTTGGTTTT CTACAAAGTA CTCACCGTTT 1500 

TCCATTTTTC CAATGTCATG ATAATACGCG CCAACTCGCG CAAGGAGCGA ATGAGCCCCA 1560 
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ATGCTACGAC 


ACGCATTTTC 


TGCAAGAGTG 


GCAACCATCA 


TGGTGTGATT 


GTACGTACCT 


1620 


GAAACTGTAA 


GCAGCATTTT 


TTTCATGATA 


GGAACGTTGA 


GGTCCGAAAG 


CTCCATAAGC 


1680 


CGGAACACGG 


TAGGAGCATT 


GGTGAGCGCT 


TCAAGGATGG 


GCAGAAGGCC 


TAACACCAAA 


1740 


ATGCCGTTGA 


GAAAGCCACT 


GATCGCCACG 


CCTGTAAGGA 


GGAATATTGC 


GTCAGTGTAC 


1800 


GCATGCGGAA ACGCAAACAT GAGCGTAGCA 


GnCAAGGAAA 


GGCTGAGCAA 


CGGCAAGGAC 


1860 


ACAGGAACTT 


TTAACAATGT 


CGAGCCGAGA 


GCTCATAACA CGCATACAAG 


CAGACGCCGA 


1920 


CACCCCAGAG 


AGGAGCGCAA 


AAAGCGTAGG 


CTCaGTATGG 


AACTGTGAAG 


CGATGAGCAC 


1980 


TGCGAACGCA 


ATGAGAAAGG 


AACTAGTGAC 


GGCACTACGA 


TGGGAAACGA 


GCGCGGTAAC 


2040 


GAGCATGATA 


CACAACGCaG 


TTGGCTGAAA 


AGGAATGCTA 


TCcAAGCGGT 


GCAGGGAGCG 


2100 


CAGCTATCTT 


TGAAAGAAAA 


AGTGTGCACA 


GGTATCCGGC 


AACGCTGGTA 


TAGAGAATGA 


2160 


GTAACTCTAC 


ACGCAgTTTA 


AGAGGAGGAT 


GGGCCATCCG 


TTTACTGAAC 


AAAAAGAAGG 


2220 


CAAGCAGATA 


CAAAAAGGCC 


AGTAACAGGA 


GACTGCTTAC 


GAGCAGGGAG 


CGATCGACAG 


2280 


ACAGTTTAGA 


GTGTGCAAGT 


GCCTGCAATC 


TTGCGTAGTC 


AGTGGCGGAT 


ACGATAAAGC 


2340 


CGCGACGGAC 


TATAATTTCG 


TTTGGATGAA 


TACTGAGGGT 


GACCGGTCGT 


AACCGCGCCA 


2400 


ATGCGTTGCG 


GACATGTCGT 


TCACTTTGAA 


TAGGGTCAAA 


GACAATATTT 


GGACGCAGAA 


2460 


AGGGTCCGAG 


GGATGAAAAG 


AGGAGCGCCG 


CCTGCGACGT 


AAGACCGAAA 


TCGGAAGCCA 


2520 


GCGCATGGAC 


GCGCGCGGCG 


AGTTGATCGG 


ATCTGATGAG 


CGTTTCAATT 


GCCTGAGTCC 


2580 


TCGAAAAGAC 


ATCCCCTGCG 


GCATTTTCCG 


CCTCATCCGC 


ACTCGCATCG 


GGTGACAGGG 


2640 


GTACAGACGC 


AGAAGGGAGA 


TCTCCTGCTG 


AGGCTTGCGA 


CGCCACGGGG 


GCCGACATAT 


2700 


CCGACGGGGC 


GTGGAAGGGA 


CGTTCCTCAC 


TGATAGTAAT 


TGTGTGGGGG 


TTAAAATCCT 


2760 


TAAGCGCATG 


GTCGGACAGC 


TGCACCACAC 


CTTGCGCGAA 


GATACGCGCG 


AGCACTTGAG 


2820 


TTCCCACACG 


CAGGAGGGAC 


TCAAACGTGT 


CATCGTCAAG 


CTGAAGCAAG 


GATCGCAGCG 


2880 


TCTGGCGCGA 


AAAGTGAACA 


AATTTCTGCT 


GCAGCAGGTG 


CACGTGTGCA 


GACGCCCGAT 


2940 


CGTGCGCAgC 


GTGGaGTGAC 


CGCTCCTCCT 


CGTAGTCCGC 


TGCTCCTCCG 


CCTCCGTCCG 


3000 


ACGAGGTATC 


CAGAGCCATA 


CCAACGCGCG 


CTTTCTGCAA 


CGCATGACAA 


AACGCCTGGT 


3060 


ATGCGCGTAC 


TTCAGCCTGT 


TCCAGATCGA 


GCCGACGCTC 


AAAAACAGCA 


GGAATTTCCT 


3120 


TCTTCCTGCG 


AGCATACTGC 


CcGCTGGGTA 


rCCAGCTCAT 


CAGTAAGGGA 


AAGAAAGmCA 


3180 


GGAGAGACAA CgTTCCgcTC AG t AC ACGCC 


CTACCGCAAA 


TCAGCAAGTT 


CAGTCTTGCA 


3240 


GGGTCTTGCT 


GTTCGCTGAT 


GCTCACCGCC 


TTGGCAATGC 


TGAGGACAAC 


GAAGGAGAGC 


3300 
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GCCAGATTGA GCGCGCGCGC ACCCgGCGGC GAAGTACGTG ACACAGCGTA TGCCACAATG 3360 

CGTAAGACGG GkTGGTCCTT TCTTCCTCAT GCGCACTTCT CGCGCCAGCG AGCrTaACAC 3420 

GCCAGCGGTA ATCTG TCC AG CAAGGACACA CGGACGCCTT GCGTACCCCA ACCGCGAGCC 3480 

TTGACAGAAC ATACCCAAAT ACCGCACCAT CGGCCTCCGC AATGAGAAGG AGTGCGACAG 3 540 

ACCGTGAAAG GATGCGCCGT CACCATCGAC CAGGTCTCAA AAGCATACGG TCACTGCCTC 3 600 

GCCGTTGACC GTGCCACCGT TCACATTCGG CAGGGAGAGT TTTTCTC CAT CCTCGGTCCT 3660 

TCAGGCTGCG GAAAGACCAC GCTTTTGCGT ATCATTGCAG GGTTTGAACA GCCGGACTCA 372 0 

GGAGACTTGA CCTTCGACCA CGTGAGTGTG CTCGGTGTTG GTGCAAATAA GCGGAGGTCT 3780 

AACACCGTTT TCCAGTCGTA TGC CCTCTTT CCTCACCTTT CCGTGTACGA GAACATCGCC 384 0 

TTCCCCCTCA GGCTCAAACG CCTCTCAAAG AACCTCATCg CGAGCGCGTG CACGAGTACC 3 900 

TTCACCTGGT ACAGCTGGAC GAGCACCTGC ACAAGAAACC CCATCAGCTG TCAGGTGGCC 39 60 

AACAACAGCG CGTCGCCATT GCCCGTGCAC TCGTGTGCGA GCCAGGGGTG CTCCTGCTTG 4020 

ACGAGCCGCT TTCTGCCCTG GATGCAAAAC TTCGCTCCAA TTTGCTCATA GAGCTCGATA 4080 

CACTCCACGA TCAGACGGGC ATTACyTCGT TTTTATCACC CATGACCAGA GCGAGGCTCT 414 0 

GTCCGTCTCC GACCGCATCG CCGTCATGAA CAAAGGAAAG ATCCTGCAGA TCGGTACTCC 4200 

CTACGAGATT TATGAGCAAC CTGCGACTGA CTTTGTCGCT AAGTTTATTG GGGAAACTAA 42 60 

TAGCTTCCTG TCAACTGTCG TCTCCTGCAC CnCCATTGAA AACGAAGAGT TTATGCTCAG 4320 

TCTCCAGGTT CCGGAACTTG ACCnTACGCT CACC 43 54 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GATACTTCCC AATGGCACTT TcsGGTCGCt GcTTTTtCyT CACgTTaACA GCGAACGTAT 60 

TGATTTTAa T ATCCACCTGC CAAAAgGAGG TTCAtTACAG GACTATGCTC ACATCCGCmA 12 0 

CACACTCAGC CGCAGCGTTG CGCACTTCTA CCGTCAGTGC ACTATTGCTC ATACGTACGT 180 

GCAGAACTGC CCACGCACtG CCACTCAGGG CAACGCGCCA ACACATTCCT CACCCCCCTG 240 

C AC CGGCGTA CGAGAAGAAC CCGCCGCTCC cTGCGCGCAC ACACCCCGGT ACGAATCCCT 300 
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GTTCCCTCTA CCCGTGCAGC ATGCGCACCT GCTTCCTCCG TCACCTCCTC ACATCTCGTG 360 

CGAACACGCG CGCGATTGCA CTCACCCAGC CCCCGCTGCC GAAGGAGATG CGCCTGTGCA 42 0 

CAACCATACC CATACAGGTG CATTCAAAGT ACTCGGACAG GTAGCAGGAA CATTCATCGC 4 80 

CGTAGAACGC AACAACGCTC TCTACCTTAT CGATCAGCAC GCAGCACATG AACGCATTAT 540 

TTTTGATACG C T AC AGCGG A ACCTTGGCAC TGCACAAATA CTTCTTATTC CCTACCACAT 600 

TCACCCACGC TCGGATGAAG AGGCGCGCAT CATGCACCGC GCCTGCACAG AACTTTCTCC 660 

TGCAGGATTT CGATTTCACG AAGAACCAGA CGGTTCGTGG CACGTAACTG CGGTGCCGCT 720 

CCACTGGCGG GGGAGCGAAG AGCAACTTGC ACACGATATC CTCTACTCAG GAAAAAACGC 780 

GCACGACATC CTGCGCCACG TCCTCGCTAC CTGTGCCTGC CGGTCTGCGT GTAAAGACGG 840 

CACCATCCTG GATGACGCAA CGCTCCACTC GTTAGTGGAG CAGGCTTTTG CATTACCACA 900 

ATCGAGGTGT CCCCACGGAC GGCCCATTTG GATTGTCATT GGCCGAGACG AATTGTTCAA 960 

ACGGATCAAG CGCACGTAAC GCGCTGCAGA TACGCAAAAA GAAGCCTGCT ACGTCTGCGC 1020 

TCTCCGCGTC GGCACGGGGA GGTGCGCCGT GTGCACACAA ACACCACCAG TGAGAGGATG 1080 

TACGGCAGCG CAAACAGTAC AC CGGTGGGC ACCACGTGAG TGCCCTGCAA TAmGTc aCAC 1140 

ATGTGTTCAA TACCGGAGAA AAAAATCGCC GCCGGCACAC ACCACATCAT CCGCTTGCGT 1200 

GCAAGAAAAA CAATTGCAAG TGCCGTCCAT CCTCTGCCTG CAGCCATCTG CGGGGTGTAg 1260 

TACCGACACG CAATACTAAC AGTCCCCCCG CACACACCGC ACACACGCCT GTaCCGCCCA 1320 

CGACACCATC CGAT t ACGCG CCGCGTCAGT TCCCCGCACC TGCAAGGTAA CGCACCTTCC 1380 

CCrkAGTGCA TAAAATTGAT ACCCACGTTT GTAGAGTACA GATACAGGTG AAAAACCCAC 1440 

ACCAGTGCAA AGGCCACCGC AGTCCCCCAC AGGGGGTGAG GTAAAACGCG GGTATGTGCA 1500 

AGAGAAACAT GAGTGAAAGA GACACCATGC GCTGCCGTGT CCATCTGCAT CGCAGAAGCT 15 60 

GCAGCGCGTG CAAACATGCT GGACGCACCA AATGCGC TC A TCCCCATTGC AGAAAAGTGC 1620 

ACTGCTATGC CCGTTAAAAA CGGATTTGCC CGCATACGCT CCGTACCCAC GGCCACAAAA 1680 

AATAAACACA GCGGCACCAC ACACACGGTA ATACCCAGTC CACCCCAATA ACTTCCCCAT 1740 

ACCAGTGCGA AAAACGCTAT GCAAAAGGAC GAGAAGGTAA TCACCCCTTC CATAAAAATT 1800 

CCCAACACTC CCGCGTATTC TGTTGCGAGC GCTCCTGCTG CAGCGCATGC AAGCGGTGct 1860 

GCGCGATGTA ATATTGCTAT CACTGTGGTG CCTATCACTC CCATCGAGAC CGCCTATGAT 1920 

GTGTATCATG TACAGAAAGC GCATGACGTC TGCGAGTTCT GTGCTTTTCC CCACGAAAAC 1980 

AAAAAACGGT GACAAGGAAT CGATATACCC GGCGTGCGCC TCGCCGCACT GCATTCCACG 2040 
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GTGCGGACCA TTGTGCAGAA ATAAGCAAAA AGATCGCCGC CTGTAAAAAT AGCACCACAT 2100 

TTACCGTCAG gTGCGCACCA AGTACTGCAG CTTCAGAGGC TGTCTCCATC CACGCGAAAA 2160 

AGAACGCAAG CGGTACGAGT ACCGTAATGT GTGCATGGGC AATTAaCGCG TGCGCTAAGG 2220 

CTGCGTAACC CATCCCCACA GAAAAACCCA CATAGCAGGT GCCAAACAGC CCAACTACAG 2280 

AAAAAAATCC GGTAAGCCCA AACAGCGCCC CTGAAAGCAC CATTCCCCAC ACATAGGTGG 2340 

CCCATACGGG AAACCCTACA AAACGCCCAA ATTCGGGGGC CTTTCCGCAT ATGCGAAACT 2400 

GATATCCTAC GCGGGTGTAC GAAAAAAAAC ACCCAACTGC GAGTGCTACT AAGGACGCAT 2460 

AGGTCAATAC GGCCGGCACA CCGAACAAAG ACGTCTGTTG CTGCAATATA AAATGCGAAT 2520 

GAACCGGCGC AGTTGCCAGC AAGTTCCCCG CAGaTCACGC GTAACCGTTA TAATCAACGC 2580 

ATCGATGAGA GGCACGCATG CGGTGGATAA CAAAAAGGAA GTAATCATTT CGCTAGTTGC 2640 

CAGCCATGCT TTTAGTATCC CAGAAACACA GGCTAATATC CCCGCGACCG AcAGCGCACA 2700 

GAGGAGCGCA ACACTCCATT GCAACAAAAA GCCCACACCC CAGTACTCAC GGAGCAACAA 2760 

TGCGGTGACA AAACCTGCAG CATAGATCTG GCCATCACCA CCTAAATTGA TCATTCCTGT 282 0 

TTTTAGCGCG CAnTCGCCCC CAGTGCCATA CAGACAAACA GTCCTGCTTT GTGAAACAGG 2880 

GCACGTATGT AGCCACGGGT AGAAAAAGGT TTGAGAAAAA ACGCTGCCAA AGATACGGAT 2940 

GGATTTTCCG AGCACAGAAC AATCACAGCA CTCATAACTG CAACACCGAG CAACACTGCG 3000 

ATACACGAAT TGATCACCCG TTTCACGTAT GAGAATC CTG AGACGGAGAC GGAGTGCCTG 3060 

ACACTTCAGC ACACAACGTA CCTGCACGTA GCAAGAAACG TTCTGTGCAC AACGCACGCC 3120 

ACTGTGCCTG ATGCTGTTCT CGCGCAAGGA GCACAAGAgc AGTTCCTGCC TGTGCTACCT 3180 

GGCGCAGACG TGCAAGCAAG CGCTGTTCAC TGGCGCTATC CAATCCTTCT GCAGGTTCTG 3240 

CCAAAATGAG AAGACGTGGA CGCGTTGCAA GctCACGCGC TAAAATAACG CGCTGCAACT 3300 

GTCCGCCTGA AAGCGTACAG GCAGGCTGCA ACGGATCGCA GTAAATTTCT TCTTCTGCAA 33 60 

GAAGACGAGC AACAAAGCGC ATCTGGcgCg GCACACGCGT GCGCCACGTA CGCAACGTGT 3 420 

AGGGAACGAG CAAATCAAAA AGAGTTAACT GCATTGAGGC ACCGCGCTGT ATGCAATTAG 3480 

ACGGCACACA CGCAACCCCG TGTGCCCGCA GCAGCGAGGG CGTATTGCGC TGGAGGGGGA 3540 

GACACCACAC CTGATCGTGC TCCTGCAAAA GAATATTCCC GGTGCAGTGC GTACGCGACG 3600 

CCCCAGCGTG CATATCACAC AGTATATCTT CCAATACGTG CACACCATCT TCTGGCGTAC 3660 

CGACTATCCC TATGATAGCA GATGCGGCCA CAGAAAACGA AATATCTGTG AGCGGAACGT 3720 

CTGCGTGTTT ACTCACCTGC AGCGACTCAA CGCGCAACAC CCAAGGACGA GCAGAAGATG 3780 
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TGCGCGGCAC AGTTGCGCAC GACTGGG TAT CTGACAGAGA GGAAAAAGAA CTTACGGCAG 3840 

AAGAAGTCAC CGTTGATGCG GACATGAGCG CACAGGACAC TTTCTGAATA CATTCATTCA 3900 

CCTGATGCGC AGAACAGTAT TCGTCTAAAA GATCCGTACG CAGAAAACTG CACGCTTTTC 3 960 

CCCCTTCTAT CAAAGAAATA CGCTGTGCCC ATCGCAATGC ATCAGCAAAT CGGTGCGTCA 4020 

CTACTATCAC TCCGCCACCA CACCGGGGCG CGTGCGAAGA ACGCACAAAA AACTCTTCAA 4080 

GATGAGAAAA GAAAACCGCA CGCGATTGCG CCGGAGCACA CCGCGGCTCA TCCAGGATGA 4140 

TGAAACGCGG ATTGCGAAAC AATACACAGA GCAACGATAC AAAAAACCGC TTGTCTGCAC 4200 

TCAAACATGC AACGTATTCT TCCTTCTTCA AGGGCATACG CCACTGGGCG AT AATGC GAT 4260 

CTATGCGTTC TCTCACcTGT GCACGGCGCA CCCACCGCAC GCCGGTgAgT GCAGCACTAC 4320 

CCATCACTAC ATTTTCAAAT ACTGTTGCGC GTTCTGC AAA TACCGGTTGC TGGTGCACTA 43 80 

TGCCAATTCC TGCACGGAGC GCATCGAAGG GTACGGAGAA GCGCTGCTCC TTTCCATCCA 4440 

GACGGAGCTG CCCATGCGTC GGCACGCAAA AGCCCGAAAG AATATGCGCA Ac GTGGATTT 4500 

TCCTGCACCA TTTTTTCCCA ACAACGCGTG AATTTCACCG GTAAAAAAGG AAAGATTCAC 4560 

ATnGCTGAGC ACGCTGTGCT CAGGrCsGTC CGTCTCACGC GCGCCCGAAC ACGGGCCATG 4620 

CGCTGTGTGC GCATCGTCGA CTGCGCGCCT GCCAGGGTGA CCGAACATAC CCCAGACCCC 4 680 

GCGCTTTGAA CGCGGCATCA CGCGCGGGTA GGTTTTCCCA ATATGGTGAA GCGAGAGCAC 4740 

ACCGCGCGCA GATGCCCTAA CGCCGCGCTC AGCTATCATC AACGCACCGG CAACGTAAGC 4 800 

TCACCGCTTT GAATACGCCT GAGCAACGCA GACTGCCGCA CACGAATCGG TTCGGGTACc 4860 

GTTTGCAGGT ACAAGGGATC CTCTTCAATG AAACGTACGT ACCCGTCTTT CACCCCCAAT 4920 

GTCCAGGCTC CTGCAGATGG CAGTTCACCG CGAATGCAGC GCAGtcTGCT CATACGCAAG 4980 

ACGCTCCTGT TCCATAACGG AACTGCCAAC TACGTAGCCC GGTGCCCTCG CATAGCCGTT 5040 

ATCGTCAAAC CACGAAACAT AAAAACCGAG CTCCCGCGCG GCCGCAAGTA CTCCCTGATT 5100 

CGCACCGCCG CAAATTGGCA TCATAACATC CACCCCTTCG TGAAAGAGAA TCCGTGCGAG 5160 

GTCTGCACTT TTTGCAGCGT CATACC AG TT CCCCACCACG CGCACATCGA CTTCAAAGGC 5220 

AGGATCTACT GCACGGGCAC CTGCGAGAAA GGCAGGAATA ATAGTCTGGG TCATCACCGG 52 80 

ATACGACTGC CCCGCAATAA GACCGATTTT TTTATCTGCA TTTGCAAAGC GCATAGCACT 5340 

CGCACTCACT AACGCGGAAA GGTGTCCTGC AAGGTAGGCT TGCTCCCACT GGTTATAGCG 5400 

AAAGGTAATC AGCGAgTGCT CtGCGGCGCG TAGGCATCTA GAACCAAAAA CCGCTGCAGG 5460 

GGAAATTGAC GCAAAATAGG CTCAAGGACG TGCGGGAGTG CAGGGTTGGA AGACACAATC 5520 
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AAACGATAGC GCTGTTCTGC AGCAAGATGC GCCAACTTTT CGCGCCAGAG CGCCTGGTTC 5580 

GGCCCCGCTT CGATGATATC AAGCCCaATG TGCGCCCTGT CGCGCGTTCc TGCGTAACTG 5640 

CACGCTCAAC ACCGTCACAC AACATTGCAT ACACAGGACT GTCGTGACGA AAACCTGGGA 5700 

CAAAAACGGC AATAgCACCG CGCGCTCATC TTGCACCGCA GGCCTACACG AAAAGCAAGT 57 60 

AAACACTGCA ATGAGCGCAC TGAGAACACA CACCGCACCG TTCATAACAC CTCCCCCCAA 5820 

AAATCCCTCT CTCGCGTAGG GTGCACCcTA CCGGCACCCA CAGCCTGAAA AGACCAGAGC 5880 

ACTACTCCTC ACCCCTGCCC CCAAACGCAT TGCACCACCC AGAGAGAAGG AGAAAGACTA 5940 

GTTTTTCACG CGTTCTTGAA AAACGTAGCA TCCGC TAAAC TCTGCGATCT CACGGATGCT 6000 

CTTCATCCAA TAT AC CGCTT CTGTACGATT AGTAAAAGGA CCGACACGCA CACGGTGACG 6060 

CAAGCCTGCA CCCGTGCGTT TCGTGAAGAT TTCTGCCTTC ATGTGTCTAG CTGCAAgCAC 612 0 

ACCCCGAGCA CGCTCGGCGT TGAGCTTACT TGAGAGCGAA GcGGCTTGCA CCCAGAAAAG 6180 

GACAGAAGGA GCAGCAGGCT GCGTGGACGC ACCGCGCACG CGCGCATcCC GCTGTGCAAC 6240 

AAGCGGAGCA CGGGTACGAG TAGCGGGAGG CGACTTGGCA GACGC TCGGT CACTCCGTGC 63 00 

ATG CTTTGAC GCACCCTTTG ACTCTGCAGA TTCCCGAGTG TCAGAGGCAG GAGAAGTTTT 63 60 

CCTGTCCTGC GCCGCGTCTG TGCGCTCAGC AcGCGCCGGA GGAGAAACAT CAAGACTTTT 64 20 

CGCACGAGCG GTAGGGACAT CCTTTACCAC CGTGAGATCA GGAATTGCCC TCTGTGTTGG 6480 

CGCAGGAGTC TTCCCCAGCT CAGGAATTTT TTCAGGATTT TTTAGCCATA AGCTCGGATC 6540 

CACTGACGGG TGCTCAGGTA CGTCACCACG CTCAACATAA GAGGACACGT CGGTCACTCG 6600 

TGCAGAATTA GAAGAATAGG TAGGAGAGTA CAACAAAAGC GCGACGCCAA AAATAATGAG 6660 

CATAAACACA CTCAGGGAGA TGACAATCCA CAAAATTCTC TTCTGTTCCA TATTTACTCG 6720 

CCTAAAACAC CCCTGCGCGT CAGAATGCAA TCGACTTTTC GCGCCAATGA CGCAACACGC 6780 

CCACAGTTGG CCACCGTGTA GGTATCGGCG TTTTGCGCGA TGCTTTGAGC ATAGAAACCC 6840 

TTCTGCGCAG AAAACCTGCG TAAAAGGTGA GTAAAAGATA CTCGTTCCCG TTTCTTACAT 6900 

CGCCACATGC GCAGTATGCT TGGCGCCCAT ATGTACAAAA CAAAACTACA GGCTTGTAAC 6960 

AGTTCTGTTT TATGCAATGT CGGCGCATTC AGTACTATCG CTTTTGGACG CGCAsCTGCG 7020 

CGCGCGCTAT GTCTTCACAC AAAAGACGGG TAACTTTCGG TAACAAAAAC TCTTCGTGCT 7080 

GCTGCAGTAG TTTAGGCTCA GAGAACAGTA ACACACCCAA ATGCGCAGAG TGCAGGCCGC 7140 

CATCTTTACG CCGTAACGAA AGGCCGCGCG CAGCgCAGCC ACTTGAAACC GCTCTACTAT 7200 

AGAGTCGCCA TAAGATTCCA AAAGTTCAcG CGTGCGCGCA TCCGCGTCAA TACAGTAACA 72 60 
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GCGAGACCAC ATTCTTCCCC GCACCACTTC GACCGATGAC 7320 

CGCGCACAGC GAGAGCGTAA CGTCAAACAC GCACCGTCCT 7 3 80 

AACACCCGAT ATTCCTGCGT ATCTTGCGCT GACTACCCCT 7440 

AAAAGACGCT TCTGTGCATA CTGTTCGTGT CTGGCACGGT 7500 

AGACGACACA GCCGTTGCGC TGCTGTTTGC ATATCCGCAT 7560 

GCGAGCGCCG CGTTTCCTGT CAGTTCTGCG TCATGAATTT 7620 

CCACTGACAT CAGCCTTAAG CTGCAACCAA CGACTATCCT 7680 

GTGTATACCG GTTGCAACTG GGTCACAGAC TCCAGATACT 7740 

AAGGCAAGAT CCTCTACCAG CTGTCTCCCT TCTCCCACCA 7 800 

TGCAGCTGAA AAGGTAACGC CATAATTTGA CCCATCCGTT 7860 

GTGCGCCGCA TATAAGAGGC AAAACGACTC CCTGAATCCG 7 920 

CAAAAATCTG CTCTCAAAGA CGGTAACACG CGGACCCTTG 7980 

GGAGGCAGAC GAACACATAC ATTTAACCCC TCGCTGGATC 8040 

CCTGCATGTA ACGTATTTGT TCCAATGAGT GCTGCCGCAA 8100 

ACCGGTATTC CCCGATAGGA GGCAACAATT GAGCCAGGAG 8160 

GTTTCAGGTA ACGCACACGC GCGTAAACTC TCAGACGTCC 8220 

CGCTCTGGTA ATACCG TAAC CGCACACCCC GTGAGTCGAT 82 80 

GAAAGAAAAA ACTGTACATC ACGCGCACAA AAATGCAATC 8340 

TTTGGCAAAA AAAGAGAAAC ACCGCACCGG GGATCAGACG 8400 

ATCAGTTGAT CCTCGGCATG ACTCTTCTTG TGCACGGCAA 8460 

GAAATAGTAA TGGCAATAAC GTGATGCACG GCACGCAACC 8520 

GAACGCACCC AATCCTGGGC CTTCACCGGC TGAGGAAAGA 8580 

ACC TTTCC AT CTTGGGAAAT AATCGCCGCT TTTAGGGAAG 8640 

AAGATCCCCG AACCCATGTC CGTTGCGACG CTACCCTATG 8700 

GACAAGCGAG CGTGTAGGAG TCGCTCGATG ATACTGCGAT 8760 

GATTGATTAT GATGAanTGC GCGcaTGACG CGcGCGAACA 8820 

GTATACCACT GCTTGTGCGA AAGCTGTGTG TGGAACACCG 8880 

GAGAAATACC GTTTCTCGTA CGCTTCATCA AATAACTCAA 8940 

GGTAAACTCA CTGCCGCGAT CACCTGCTTT GCTCCGCGAC 9000 
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TTTTCAAAAA TTCCATCGCC TTTAGCATCG TACCTCCGCT GCCAAGCATA TCGTCAGCAA 
TAAACGCCGT CTTCCCCTCC ACATCGCCGA GCAAGTTAAT TTCTACAATA TTGCTCTGCT 
TTGCATTTTG CGCGACCACC GAATAATCAC GCACCTTATA AATCATCGCG AGTGGCTTTT 
TTAAACCAGA AGAATAAAAT TTATTC CGTT CAACCGCCCC GCTGTC CGGC GCCACTACTA 
CAAAAGGGAT ATCGGGGTCA GAAAGATTTT CAATCTTTGC CAACTCCCGG ATAATCTGAT 
AACTGGCGTG TAAGTTTTCA AGCCGCGTGC GATGAAAGGC ATTTTCAATC TCACGTGAAT 
GCAAATCAAG AGTGACAATG TGACTCACGC CAAGATACTC ATATACACTC CCGAGCAAAC 
CCGCCGTCAG TCCCTCACGT CCACACTTTT TGTGCTGACG GCTATACGGA TAAGTGGGTA 
AAACCAAGGT AACGCGCCCA GCTCCCGCGT GCCGAACTGC ATCTATGGTC ACAATGAGCA 
TCATCACGTG ATCATTCACG GAGAATATTT TTTTACTTTT TCCGTTATTC ACCAGGACTG 
GTTGATGATT TTCTACATCT TGGAAAATAA AAACGTCCTT GCCACGAATA CATTCATTAA 
TTTGCGTTTT TAACTCACCA TTTAGAAAAC AGATAAACTG TGCATCCACC TTAAAATGCG 
GTGGATTAAA ACGACGTACG TCATCGTGCG CACACAACTC TGTGGAAAAA AGGTC CCGGT 
AAAAGTTCGC ATCTCGTATC ACCGACCCCG AGTCAAGACC GTAGCGGTCC GTCAAACGGT 
CCATTCTCTG GTGAAACTTG CGTTCACACA CACGCGTCAA ATGTTTGATA GTTTCGTCCG 
CGAAGTGCTC GCCACCAGGA CAGGCGACGA TCGCCAAATC AGTAAACCCT GAACATCTCA 
TGCAATCTCT CCACACTTGT CAACGCGGAC TGGACAAGCA CCGTCTCAAC CGCACGATAG 
GGAGCCCATC CGCGCAGGCT AATACAACGA AACACACCCA AGCGAACACA CCTTGCATAC 
GCAAGGCACA GCGAACGCAC ACGCGTCATG CGCACAGGGC AACACTTTAC TTACTTATGA 
TAGTGATTTC TACCCGTCGG TTTTTTCTAC GACCATCCTC TGAATCATTT GGCGCAATAG 
AcTGctGCGC ACCACAACCG CGCGTATATA CATGCGCTGC ATCCACAACA CCTAATTCCT 
GCAGGTAACG TGCAACCACA TCAGCACGCT CTTCAGAAAT CCTCTGTTGA TCCTGCACAG 
ACCCCCGTCG TGC CGCATGT CCAGACACCA ACAACTCTCG ATCGGGAAAC GCGCGCAAAA 
GTTCTGCTAT TTTGCGCAGC TTCTCGTACT CAGAAGGTGC AAGGGATGCA GAGTCTGCGT 
CAAATTGAAC ATTTTCTATA CTGATGGTTA CTCCCTCTTC TGTTTCACGC ACcTTCGCAT 
CAGGCATATG CAAGTCCTTG AGCGTTTCTT GCAGTTCCAC CACCGTGCGT GCAGGATCAA 
AGCGCTCAGG TGCAAAATTC TTTGCCGTCG CAGTACCCTG ATATCTCAAA ACCGTTCCTC 
CACTCAGGTA TAAGAGAATG CGAAACTCAT CGTCGTACTC AGCTATGTTC CCAAGTTCGT 
TATCCCAATA CAAATTTTGC TTAGAAACGC CCGTAGTACG CACCGGATAC ATTCCTTCTT 
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TTGCATTGCG CTGCACGCCA TGTGTCCGTT 
CAGTAATGTG ATGATAGCGC CGACTACCAC 
CGGTAAACGG CACAATGAAG GGCGTTTGGA 
CTTCTGCTTC ATGCTCCCAG GTATCTCCAA 
CATTC CGTAC TACCGGCATG AAGAATGAAC 
GCCAGAAAAT ACTTTCGTAG TGCCTCCCCC 
ATGTCATGAA GTGACACACA TACCGCGCCG 
CTTCGGAAAC ATGCACCGTG ATTCGATTCG 
TCACAAACAC ATCCTCGCGT ATCAGCGAGT 
TGTaGCGCAA GcgCAGAGGG TACGCCGCAC 
GCATTCCCTT CCTAGAAAAC ACACTTCCCA 
ATTCCGCACG AGAAGTTGCA CCCCACCCTT 
ACGCCCTATC AGCACCTGct GCGCACCCAA 
GCGAATACCA CCGTCTACCC ACACTTCACC 
AAAAAGAAAG TCCgCTGTGC TTCCGCGCGC 
GATAGCGACA TCCGGTTTCA ACTCTCTGAC 
TTTCACCACG ATAGGAAGTT TTGCAAAACG 
TTTCTCTAAC TGCACTTTAT CTCTCATGGT 
CACAAACTCC GCAACGTCCC GACCCCACTC 
CGGTTTGATA AACACTGCAG CCTTCTTTTT 
CAGTTTGATG TCCGGACAAC CATCTCCCAC 
TTCGATGAGC CGATAATAAA AC GAG AC C TC 
CGTCATAGGC GCAAGACGCA CGACCGGTAG 
TGCAAGGCAG TTCGCGATAA AATTCGCACT 
ACCACGACAC CCGTATCCAT CACAACGGGC 
AGAACGCACC ATGCGATCTT CCTTTTCCCT 
CCCATAGCGC TTCAAAGCCT GTGCAAAACC 
TGCGTGTTTT TTTGCTTCCT CATGACCATT 
GAACATGACG ATATCATTCC TCTGATCACC 





13041 
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TGGGCGACTC ATAACTCATA GAATACGCCG 10800 

GCTGAACTTC ACCACGGTAG GTATACGACA 108 60 

T AC C AAAGC C GTCACGCAGA TCATGCGCTT 10920 

CTTCGATATC ATAATCAGGA AACACCGGCA 10980 

GATCAATATC ATACACACCA AATGCGTCGC 11040 

AACGAAACGT ATTATTGGGA CTTTTCTCGG 11100 

CATCAGGCGC CGACCCGTGT GCAACAckTA 11160 

TAATCTCCGC CGTGTGAGCA AGCGTATCGT 1122 0 

TGATACGGTG CGTATCCCCC TTACGAAACT 112 80 

TTGCCGCGCA CCAGACTGCA AGAACACACA 11340 

TCATACACAC CGAACGGACC TATCCGTCAT 114 00 

TCCTCCTTTG AGAAGAGCTG TGATAAAAGG 114 60 

CTcCCstGct GCGCACAAAT GCGCATAACT 11520 

GGCACAGCGC GCTAACTCTC CACCGTATTC 11580 

AGTTTCAACC cTGCCTCCAT GATTTGATAC 11640 

TAATTCGACA TCGCGCGGCG CAAAAATTCC 11700 

TCTGACTGCA CGAAGGTGsG TAGGAGTCTT 11760 

CACAATATGG TACGCATCGA TATCCACACC 11820 

GATACGTTCA AAAATTTTTT TGTTCACATA 11880 

GAAGGAACGC AACGCAGCAA TACCCGACTG 11940 

ACTCAACAGG ACACCcGTTC CGGACACCGC 12000 

GTCAGGATAT CCGACGTTTT CAACTGCACC 12060 

TTCATGCCTT TGCGGAACAT ATTTTTTCCA 12120 

GTTAAAAACA CCCCCCATCC CTGGTAACTG 12180 

ACACAGGCGA CACTTGTACT CTGAACGCGC 12240 

AAAAAAGCAC TGCCCTCTTT ACTCTACCAC 12300 

TGCATCATCA TTAGATTCGG CAATATACGT 12360 

ACGCATACAA AAGGAAACGC CCGCTGCTTT 12420 

AAACACGCAT ACTTGTTCCA AAGAAATGCC 12480 
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GTAGTACTGG CACAATATAT 
AATCAGGTGC GGCATGGAGT 
TGCTGTTTCA AGATCTGAGG 
CCGATCGCCT GATAGTGCTT 
GGCCAAACGA TTGTATTCAT 
ACTGTCAGAC GTATAAACTA 
AGCAAATGCT TCAGCAGCAA 
GATCATTCCC CCGTTATACC 
CTGCACCATA GGCAGCATCC 
AC ACTTT AC C GCCTGCTTAT 
GTCCATATCG GTAACC AC C A 
TGAACAAGTG CACACCCAAA 
GTCTCCTTGC TTTATCGCGT 
TGCCATCAGC ATCTTCCATT 
CTCGGGATGA TGACCAGTCA 
CTTTTTTACC GTGCGCGCAT 
AAGATGCGGA AAACGCATAC 
ACCTCCAGGC ACTAAAAGAT 
CTCTATCACA AACCACAGGG 
TTGTTGGGGA CGATTCGTGG 
TAATGGAAAC GTTGCAACAC 
AAATTCACAG ACATCAACCC 
AGCGCCATAT GCACAGCCAA 
ATCTGCACAA AAAACTTCTG 
CATTCGAATT TGGTCAAAGT 
AAAAAAATAT GACTCGGAGT 
ATCGCATTCA TTACAGCGCT 
TACCATTTCT CCACAAAAAG 
ATCTGCCAGA TCACGACACT 
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GCAGGGCATT CCCTTTATCC ACTCCCTCGG GC ATT AC AT C 12540 

GTTCACAGTG CACTCCATGC AAACTGCCAA TAAACTCAGC 12600 

GAGAAAAATT TTGCACAAGG ATTTTCATAA CCTGTCGCTG 12660 

CTATGGGAAA GATAGGGATA AGCGCGCTCC CCCCACGACG 12720 

GGAACGCCCG TATACGGACA CTGTAGTCCG GTGCATATAC 12780 

GGTAATCTAG GCGATGCGCC ATGCCGTATT CCAAAACACG 12 840 

AACATTTTTG AAAAATAGTT TTGCCCGTCT TAATTTCACG 12900 

CAACAACCGG TCCAACCAAC TGCAACTGCC GCACATACTC 12960 

TACCAGTTGC AATAACAAAA GGGATATTCT TCTCGTGCAG 13 020 

TCTCAGGTGG AATGTTATTT TCGCTATCCA AAAAAGTACC 13080 

ACCTAACCTG TGCGCCTGAC ATCAACATCA CCTCTGAACT 13140 

AAACAACACG GCAACTTCAC TTACTCGCAG ATCCATGCGC 13200 

ACACCTCCAT AGTGTCTCCT AATTGGAACC ATTTGCTCAA 13260 

TCAACCCACC AGATTTTAAA TGCTTGTGAG GATAAAATCG 13320 

C C ACT ATTTT CTTTACCGTA AAACCGTAGC GCAAGAGCTG 13380 

CCCATATTGT AAAATGATCG CAAGGACTCT GCGCAAAAAA 13440 

CAGTGACACC TGCAAAATTA GGGGTAGAAA AGGCAAGAAT 13500 

CTGCCACCTT CCTGAGTACC GCCTCCAGGT CCTGAAAATG 13 560 

TAACGGCGGA GAACGTACAC TCTCGAATGT AAACAGATAC 13 620 

CAAAGCTGTG CCGAATCACA AAGTCAAAAC ACTCAGGAAG 13680 

AAGCAGGAAT GCAGAGTGTG TCTCGCACAT GACGCACCGC 13740 

CAACAGCATT CCACCCCGCA GCCTTTGCCG CAGACAAGAA 13800 

CATCTAAAAC CTTTTTATCA ACTGCAAACG AATTACCTTC 138 60 

C AT AT AAGC G TTCGATCTCC TCCATCCTGC GCGCACCATG 13920 

CCTCAAGGTA AGTCTTGCCA TACTGTGCCT TGTATTCTTC 13980 

ATCGAACTGG ATCTGAAATT ACAAAGGAGA GAAAAATCAT 14040 

GAAAGGTTTT ATGTAATGCA CGACCGACAA CATCCACCGC 14100 

GACAGCAGTG TTTTTTCCCC CGCGCAACGc gCATGaTTTC 14160 

GCGAATGTCG CGTCACAACA TCGGGTATAA TAATTCCCTG 14220 
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CTGCATCTGA GACCACAACT 
CCACTGcACT CGAGAGTAAA 
CTGCAGCTGC AGCCTCAAAT 
GGAGATGCTC TTTCAAATGA 
CAGACGCTTC ACCAGGAACT 
CACACGAACG CGCGCGGCCG 
CGCCAGGTAT AGGAAAAAAT 
CTGGAAGTGG GATAAACGCA 
CCGGACTTTG CAAGACGGGA 
CCCCCTCATC CAGAGCAAGC 
AAGTACGGAA GTTGTCGACC 
TTCCTCAGGA AAGGGAGTAG 
ACAGCGCACA CGTCCTTGCA 
TCCCTGTCCT ATTTTCACTG 
ATCTAAAATA ACATCGGAAG 
CATATGCTGC GCTCGCTGAA 
ATACCATGCG GCAGCGGcAG 
GGCAGGTCCT ACATGCTCAC 
CGAGCGCGCC TTTAATATTT 
ATCTGGCTCG TCCAGTTCCG 
AAAAGGGTTA TCTCCGGTAA 
CTTAACTGCG ATACAAAAGC 
CCCATGCGCG CGCGCAACGG 
ATACGTTTCT GCAGGGATGA 
CATAAGGGGC AACAGCGCCT 
AACTGCAACA CCGGAACGAT 
CGCGGCAGTT TTTTTAAAAC 
GAGACGGTAC GCCTCTATTA 
GCCCGAAATA AAGGCCCTAT 



539 

CATGCGCAgc AGGnATCAGC AGATCGAATA ACAGAAAAAC 14280 

AAATGATAGG GCGTTGGTGA AACAAGCAAC ACCGCCGCCC 14340 

GCGGTAAAAC CAAAATGGGT AACTACCACG TCCCAGCGGT 14400 

GGCAAAGAAG GGTATATGTG CACCTTACCG TCCGTCCGTT 144 60 

ACCACTGAGG TGTCAAAACC TAATGCAGCT ATCCGCTCAG 14520 

TGTGTATCTT CTGCCCCATA CACCACCAAT ACGGTGGTGA 14580 

CTGCCGTTTG CCACAGGCAG ATCCTGCTCC TTTCTCGTCT 14 640 

CTATCACGTA CATTCGTAAG AGCGGCCAGC GACCTTCGAC 14700 

AATACATCTA TCAAGTAATC CGCATTCAGA CGTCCAGATC 14760 

AC TGGTGC AG TACGCTGAAG TAATTCAATC TCGCACGTAG 14820 

ACTATCAGTG ATGCATCCGC AGAACCGTGT CCACTATCAG 14 880 

ACAGTTTGCA CAGCAAACTA CGATCGGGCA CATACAAGCA 14940 

GTCGCAAGAC TAAATACGCC GCCCGATATA AATGCCCCGC 15000 

AGGGTACAAA TACCACCACC TGCCGCGCGT ACGCATACGC 15060 

AGAGCGGAAA CGGACACTGC AACGTGCACA CGTACTCCAT 15120 

AGTCTTCCTG CGTGTCCACC GTTACTCGCA CATCAGGGTG 15180 

GTTCACG CAC ACACACGAAA ATACCCGGGC GGCGATGTAA 15240 

GGTCGTACGC CTCCAAAGGA AGACGATCTG CTAAAAGCAA 15300 

CCACTCCGCT GCCGTAGGGA AGACCAGTGA AGGTAAAATA 153 60 

CATAACGCAG GAGCGCTGCA GCAGCTGCTT CGTGAAACAA 15420 

CCCGCACGAC GGTTCGAATT GGGAACGAAT GCTCAAAAGC 15480 

GGTGGAGCAC ATCTTCTGCC GATCCTGAAA TACAGTAGAA 15540 

GTTCAAAATC TTTTTTAGAA TGTTCATCAC ACGCAAGAAT 15600 

CCCGCGTTGC CTGCAACACG TAATGTATCA GCGGCTCCCC 15660 

TTCCTGGTAA CCGCGTAGAA TCAACGCGTG CTTGCACAAT 15720 

CCTGCTCCAT TAAACgTGGT TCCCACTCAC TTACTAAACG 15780 

GGTATCTATT CACCGCCACC CATTTCTGAG CGTCCCGAAA 15840 

GATGCAGACC AGAATTTCTA AAAAATGAAA AAAACTGCGT 15900 

CCTTCTCTAG GAGTGGCGCA AGGTTTTTCA GATAAAAGAT 15960 
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GCGGTACGAA CCATCTGCAC TTGCATCCTC TTGCGGGGGA AGAGCCTCAT 
TATTTGCGTA TTAACCTCTA TGCACTCCCC CCACAAATAA GAACGCAATC 
ATTTTGCCAA TATGCATTCG CGATGGTAAA ATCAAAACCT CCGCATTGGA 
CCGATGGTAT ATTCCAACAA AATCATACGG GTATATCGTA GGCGTATGGT 
TTCCGTAGGC TG TGTAAAG A AGTCACTTTT TCGCAAGGCA GGGACAATTT 
TACAGTACTG TGGGAAGAAC ACAATTGCGG CGCAATACAC ATGTGCGTAT 
GATATCCTGT ATGTGCTGGC TCATACCTGC GGATCTATGC GCATGTCGCT 
AAAACATACA TGACATCAAG TTCTGCAACA CCTAAATTGA TCATTTCTCC 
CACTCAAGCg GTGTGATAAA CTTAACGAAG GGGAAACgCT CAGCAAGACC 
GGAGCTGCTG CACTCCGCTC CACCGATACA ATGGGCGCGA TGCCAAGGGT 
GCAAAAAACT CTGCCCGATT CCACCGCACT CCCCGATTGA GTACTACTGC 
GCAGACGCCA CCGGACAGAG TTGACCACCC ACCACCGTGT GTGCAAmATT 
AAAATTATAG GTATAGTACT CATCACACTG CCCACATAAT CCTTCATGTA 
GTGCTGAAGA TACACATCCT GTCCCCGCCG CCAAATCTGT GCAAGATCCT 
ATTGCCCAGC GCG TGGCGAC AGTGCACATC TTCTTTGCAC AGCGGAACTC 
GAAAATAATC ATATCTCGTT TTAGATGCCA ACACGGATAC CTCTCCAAAG 
TGCAACCCGC CGATCGGGAA GCAATCCGCA CACATGATCA TACTTTTGAA 
CCCCACACGC TCCTTCCACG TGCGGTAAAA GGGCTCGAGC TCTTTTTCGT 
ACGGAAAATC TGTGGCCACA GCACGCCCGG ACATTGCGCA TGTACCTGCA 
CGTCGCTTCT TTTAGAAAAA ATTCCGCTTC TGACAGCGAC ACACGGTGTA 
CATGCCTGAA CTCACCGCAT CTAAAAACAC AATCCAGCCA ATGGCAAAAG 
ACTGTTGCGC GCACAtTTCG CACAAATCAC GCACTACAGA CTCCTGcCAC 
TTGTCTCAAT GAGCACCGAC AAAC CCGGAT ACTTGAGAAT CTCACGTACG 
GTGCAGGATA CAACAC CGG A TCCCCAAATA CCGAAAGCGA AATGACCGCA 
AGTCTGCAAT GCGCCGGATC AACGCACACG CTTCCTCTTT TGGCATCAGT 
CCACCTGCGC AGGAAAAGAT. ACTGGCCGAT ACAACGAAGA AAGCGGrTAC 
ACTCAAGCGC ATAGTACGCA GGAACTGTGC GCAACGCATG CTCACGTGCG 
GTGCGTgAAT TTTCTGCAGT GATATCGGTA AATGCAGCAC ACTGCAAGAA 
GAACTCGTAT AAAATTCCAG ACGCAGATGC CGCACATCAA CCGGCGCAAT 



PCT^ PK/1 3041 

AAAACAGTCT 16020 

CAAAATCAAG 16080 

TAAACCGGTC 16140 

TAGTCGTACA 16200 

GCGTTGGGAG 162 60 

TCGTGCGCAA 16320 

CCACAGAACT 16380 

AACCGAAATA 16440 

GGTAACATCT 16500 

AGTCAGTTGC 16560 

ACCTAAAAAA 16620 

TTTCTCGTTA 16680 

CTGCACGAAC 16740 

GCTGGAAAGC 16800 

GACCATCAGT 16860 

GAGATAGATC 16920 

CAATCACCTG 16980 

TTTCATTCAT 17040 

TTGCAAATTC 17100 

CCTGACTGTA 17160 

GAGTGCGCGC 17220 

CCCAACCCAC 17280 

ACGTCACACA 17340 

CGTTCTGAAA 17400 

GAAGCATTCT 174 60 

GCACGCGTCA 1752 0 

CTGATAAGCT 17580 

CTGCGCCTTA 17640 

CATAGTTTCT 17700 
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AGATCAAAAG AATTAATGTC TGTTTTAATG CTCTCAAAAA TGAACGAATG wCaAAACAAA 
TATGTGCATC CTGAGTCAAA GTAGCAAGGA TGGGGAAAAG TCCTGGCGCA ACAACTGCAG 
CAAATAACCC CTCCGGGTAG CCATCTGCAA AACTATACTC TGCAAGATAT TCGCGATGCT 
GCGCGTATAA CTGGGCGCTC GCCACACTAT CAATAAAGGG CGCGTCTGCG gCAGaACAAA 
GACAGCCTCA GGTTCTTCAC CCAATTGCGC GTATTCCGCA CATACGCGTG CAACATGCGC 
GAAGAAGGCG CTCACTCGCA TGTCATC C AG AACGTTCACG CGCAGaCGGg AAAAATAGGc 
GTACGCAcTG CATAAAACGC GCAACCTTCG cGGCGCTCGC gCATcCGCGT ACACACACAC 
CTGATGGCAA CCAGGCAACG CGTAAGCAGc TGTAACAGCA CGCTCGAAAG CGCAACGCCG 
CCCCACCCCG GCTGTACTCT CACAGCCCCT CACCTCCACG CACGGCACGA ACCCCGGCAC 
TGCTTTCATA CACGACTGct CACGCGTGCC ACACAAGGGC ACAAAGGCAT AATCGCTCAG 
ATCAAAGGCA CAGACCACAG CAACCGTTCC CACCGCCCCT TTATCGGCAC GCATGcAGGA 
GAACCTTGCT CAAAACTTTT CTGAACTTTA AAAGCACAGC CCCGAAGATC ACTCAACCGA 
GTGAGAAAAA GAATCATCAC ATACTCAGCT CATGCAATTC ACGCCTGCGT AAACTTCTCT 
TGCAAAAGGC GGGCAACTAC ACGGGGGACA AAAGTAGACA CATCACCACC GAAAGAAGCA 
ACCTCGcGTA CCATGcTGGa ACGAAGCGcA GCATaGCaGG GcTTTGCCGc CAAAAAAACT 
GTTTCTAAAC CAGCGTCGAG CGCACGATGA ACCCATGCAA GATCAAACTC CTGACAGAAA 
TCAGTAGCAT TTCTCACACC GCGAACCAGC ACACGCGCAC CAACATCTCG AGCGTACGTA 
ACCACAAGCG AACGCCAAGG AAAGACGTAC ACACCCGGAC GATCCCCAAG GACTTGCCGC 
ATCAAATCAA CGCGCTCACA TTCTGAAAGC AAATACC TTT TCTGAACATT GACCGCAACC 
AACACGTGGA CCTCTGcAAA AAGACTACGC GCGCGCAGAA CAAGATCTAA ATGCCCAAAG 
GTAGGCGGAT CAAAAGAACC GGCGAAAATC GCCTTCACGC ACGGCAACCC CTCACGTTGT 
TCAGGAACAT GCGGCGAACT ACACAAAAAA CAGGCAGCAC TATTTATATT CGCACCCACT 
CCGTCAACTC CTCGCGGAAA ATCGGTGTTC ACGCGAGCTC TCTACGCTTC CTAGCTACAC 
AACGTGCGCA CCGCAsGnCG CGAACGCTTC TCCTCCTACT GTGCTCTTCC CCCACCCACT 
GAGATCGCAA TAGCACGCGA CTTCCCGAGC GCCGCCCATC CATTGAGCGG AAAGCGCTCA 
TACAGCTGCA TGTAGGCAGC TGTAGCAGCA GCCGCGCGCC CTAACGCCTC TTCCATGCGA 
CCAACGTTAA AGAGAGCACG GGGAACGAGT GGAAAATCCT GCACGCGTGC ACTCCTCTGA 
TACAGCTCAC GCGCCTCCTC GAAGCGACCA CGCTCATCCG CGCAAGACGC TGCATTAAAA 
TAATACACGC CGGCCACGTA GCTCCTACGA GCACCATACG CTGCACGGAC ATAGGCCTGC 
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TGTGCCTTCT CCCACTCCTT ACGCGCAAAG AAAATGTCCG CGACACAGGC CTGTGCGTAC 19500 

GCATATGCAA AGCCGTCCCG CCACGGCGAA CTGGCGCACG ACTCAAGGCG CGCCAAAAGC 19560 

GCATCCTCCT TCGCACGTAT GGAAGACCCT CCCCGCCCCT CCACTTCATG AC C AGAAGCG 19620 

CTCACTGTAT TATCCGGTTT ACGCAATACG TCCCACTcAC GCGCTATGsk CgTnACCTCT 19680 

GCTGAAGCAC GCGCGCGTAA ACGCGTCATA ACCAGCAAAC ACCCTGCACT TAGCCCTAGC 19740 

CCCCCGAGGA TAGCGACGAG CACACCCACA AGCAACCGCC GGTGCAGTTC CAAGAACCGA 19800 

TCAACCCGCA CTATCCCACG CCGCTGCTCA TGCATGGATC CCTCCTCCCC TCACAGAATC 19860 

ACTCAACTCG AGAACCCTCA CCGACAGgcG cGAGCaGCGa AnmCCCACAC CTG cCCAAGG 19920 

GAAAGAACAG CACACCGGaA CTaCCGcAAG ATACACACGG aGGCTCGGGc ACCTaCCTcA 19980 

CACGcAAAAG ACCTTGCGGA GAAGCACCCA CAACAACCGA GGAGGAGCCA CCAAAACCGC 20040 

AGTCAACCCA CTGCGCCCGA AAGAACCGCG TATCACACGC GCAACACACC GACCCACTCA 20100 

CTTTCCAGAC GTTTGCCTCA TCAAATCACC GAGCGTAAAC GAGCCTTCGT CCTCCCCCCG 2 0160 

CGGGGCGGaC ATATACCGAG AAAGCTCGTC ACGCTGTACC TTTCTTTGAT AGTCTCTAAC 20220 

AGAAAAAGCA AC C TTCCTGT CCTTCACGTT CATATCTACG ATCACTGCCT TGACCCGGTC 20280 

CCcCACTGCG TATTTCCTTA GCGCTTCACC CGGATCCCCA TCCCGATTCT CAACCAGATG 2 0340 

CTGCTTGCGA ACAAGCCCCT CAACGCCACC GGGAACACGC ACGAAAATCC CAAAATCCGT 20400 

CACGGAAGAT ACTTCCCCCT CCACGGTAGA CCCTACCCCA TAGGCGTTCG CAAACACCTG 20460 

CCACGGATTG TCGCTCAACT GCTTAACACC AAGCCGAATA CGGCGCGCTT GCGGATCACA 20520 

CTCGATAACC ATACACTCGA TTTCTTTACC TACCTCAAGC TCATGGTCTG CAGGACGCGT 20580 

CCGCTTAACC CAGGACAGAT CATCGACGTG CAAAAAGCCG TCTATTCCCT CTTCCATTTC 20640 

AATGAAAGCA CCTGCGTTCG TAACCTTTAC GATACsGsGC GTAAAGCGcG CACCCACAGG 20700 

ATAACGAGCC TCTATTTCCT CCCAAGGATT CGCCGTTACC TGCTTAAGCC CCAGAGACAC 20760 

CCGTCCCGCC TGGATATCAT ACCCGAGGAT CATACACTCC ACTTCATCCC CAATTTTAAC 2 0820 

CATGTCACTG GGTTTACTCG TTTTCTTTAC CCAGCTGAAC TCACTAATAT GCGCAAgCCC 20880 

CTCGATACCC TCAGCAAGTT CAATGAACGC ACCGAAATCA GCGATTTTCG TTACACGCCC 20940 

CTTGACCACA TCATTCACGC CGAAC TTGTT TTCAAACTCA AGCCACGGAT CCGGCTGAAA 21000 

ATGCTTCAGG GACAAATTGA TACGCTTcTC CGCCTGATCC AGGCGGATAA CCTTCAACTC 21060 

AATGGTTTGT CCTTTCTTCA CAAACTcGcG CGGCCGCGCC ACGTGCCCCC AGCTCATGTC 21120 

ATTCACATGC AGGAGGCCAT CGAAACCGCC CAAGTCAATG AAAGCACCAA AACTCGTAAA 21180 
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GCTCTTAACC ACTCCGGATA CGGAATCTTC AATATGAACC GAATTGAAGA 
CGCCTGCCgC GCACGCTCCT CCAAATACCG GCGTCGATTA ATGACAATGT 
GCGATGCTGT TTGCTTTGGG ATATACGCTC GATATAGAAC TTAGACGTAA 
ACTCTCAGGC GCGTCGACTT TCTGACAGTC CGACTGGCTG ATAGGTAAAA 
CCCCGCACCC AAGTCCACTT CAAAACCACT CTTCTTTTCC GTTAGACGGA 
CTCAACCGGA GTCCCGTCTC GCTCCGCATC ACGTAACTTA ACTTTCAAAC 
GGCCTTCGTC TTGGAAAGCT CAGGGCCATA AGGCGTCACG CGCTCCACAT 
GCCATCCCCT GcCTTCGGCG GcGCCTCAAA CTCTTCCACT GGAACGCGCC 
TCCCCCGATG TCTACAAACA CCGTCCCCGC ATTAACCTGa ACCACCGTCC 
AGAACCAGGT TCCGGAGCCT CAAACGAATA CCGCTCCTGc AGCTGcCGCG 
TGT Ac CCTTC CCCTcCTGaT TTTCCACTGa ACGCTCTCCT CCCCACAAAG 
GCCTCGCGCG CGATTCTTTC ACAAACCTcC TcAATGGTCA AGCAAGAAGT 
GCGGCATCAG GGGCACAACT GAGCCCCCCC AAGGTGnGnG CCCTGTnG 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13518 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 



PCX 

ACTCCTcGCG 
TGTCGTTGcC 
GCCCAATGAG 
AgGCCATCAT 
CGATCCTCCC 
CCAAGCGATC 
ACACCCGAAC 
CTTCAGATTT 
CCATCCTAAC 
GcACCAATGG 
CTyTGCGGTG 
ATCCAGTACA 



3041 

21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21948 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

AGTGTTCGCC CGACGGGAAA CGTATGGGGT CGGTACATCG AATGTTACCC CGCGCATGCA 60 

CTAGAGAGCA ACACAACGCC CCGGGGAATG CGCATAGGAG CAGCAAGAGA AGCGGCGCTG 120 

TACGAAGGGG AGTACGCATC CTGGTAAGGA ACAAATACGG TATAGCCACG TTCGGCAAGC 180 

ATCAGTTCCA GCGTTTCAGG GTGTGCCCCG GTGCGCAGAC TGCGGGGTAT TTTTTAGAAC 240 

AACCACGTTC ACGCCGACTG GGTTTTGTTT CGGGTGCGCG CGCGACTGCG GkTTTTCGGA 300 

AGGkTkTGGC GCAGACGGkT TATCCAGATG AGATGTTTCG TAGGGATTGG CCAGGGAGAG 360 

CAGGAsCCcT GCGTTCACCT TCGTGTTACy TAsCCGAAGG GAAAAGGAAC TGCGCAyTAC 420 

GCTACGGCTG GGCCGATAAC CGGGCTCCGG GGCACAGAGG CAGACAATCA CAAGCGTGAA 4 80 

AACACCTGCA ACGAGGAGCA GGACCCGCGC AATACGTGCT CCTGCACTGT ACAGGTCATT 540 

GGGAATTCCC TGGCTGAAGG CGACAAGGCG CGGCAACTCC GTGAGACACA CCACTGAACT 600 
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TGAGACTAGA CTCAAGGTAA GGTCCAACGT CAGACCCTGT CCGAGGACTG TCAGTGCACC 6 60 

GACACTCAGT GCAAGCACTG GCAGCAGGGG GATAGCGCTT ATCTTTCTGC TTTGTCGGCT 720 

GAAGGGGCGA AGAAGCGCCG GCAGACTCAA ACCGAGGATG AGTAGCTCGC TGACAAGCAA 780 

ATCTGACACG GCCTCCGTCT CGGTTGACTC GGTTGTACGC TTTACCAAGT TTTTCTGAGT 840 

GAGmaCCACC TGTTTCCTTG TTGCGCAAGG GAACAGGTGG TGGTAGGTTT GCGCGGTGGG 900 

TACCTTGTAC GTGGTAGCAA CGCCGATTGG AAACTTGGCA GACATCACCC TCCGTGCCTT 960 

AGATGTATTG CGAACGGTGG ATGTAGTTGC CTGTGAAGAC ACGCGTAGGA CGCGTGCGCT 1020 

CCTGTCTCAT TTTGGGATCC ATAAGCGTCT TGTTTCCTGT CGTGCACACA ATGAGGCGCA 1080 

GGCGGCGCGT CGACTCATCC ATTTTTTGAG CACCCCTATT TCTGCTTTTC TCTCTCCAGA 1140 

GAAGGGGAGG GGCAGGCAGA GCGCGCGGCG CACGCGTGCA CGTCCGGGTG AGACGGTAGG 1200 

GACAGCTGCG CTGCAgCTcG CTGCAGAAGC AACGGGGGAA CAGGAAGTGT GTGGATCGCC 12 60 

GCACGCACAG GTAGCCTATG TTAGCGATGC AGGTACGCCG GGGGTCAGTG ATCCGGGAGC 1320 

GGTTTTAGTG CGCG CGGTGC GGGATGCTGG GCACACGGTG GTACCGATTC CCGGTGCTTC 1380 

TGCACTGACT ACTTTGCTGA GTGTTGCAGG CGTGCGAGAC AAGACCGTGC TATTCGAGGG 14 40 

GTTCCTTTCA CCTCACCCGG GTCGTAGGCG TGCGCGCCTG GTGCAATTGT GCGCGCAgcg 1500 

TGTaGCTTTT GTTCTGTACG AGAGTCCCTA CCGGGTTCAA AAGCTTCTAG AGGATCTGGT 1560 

GGCGGTGGCG CCGGAGTCGC AGGTGGTGCT GGGTCGGGAA TTGACCAAGG TGCATGAGGA 1620 

GCTCTGTGTG GGGACTGCCT TGCGCGTCAT GGAGAGCTTC TGtGCcGGAC GCGytGCGGG 1680 

GGGAATGCGT GTTGCTGGTT TC TGC AG AAA AATTTTAGAT CTTTATTTTT CTTACAAATT 1740 

TCCGATAATG GGGCGGGGGT GGGGCTCTTn TGATGATCGA TAAGCTAATn GACTTGATCC 1800 

GGTTCAGAAC CTTCGCGCCT CTTGTGCGTC TGAGCATGTG GCGCGTGCTC CAGCCGGCGA 1860 

TGAGATTACT GTCTCTGCGG AAgCCCAGAA AAAGGCTGAG TTGTACTTGG CCCTGGAGGC 1920 

GGTACGTTCT GCGCCTGATG TGCGTGAATA CAAAATAGCA GCTGCGGAgC agAaGCTTGC 1980 

AGACCnTGCG TATCTGGAGC GGGCGCTGTc CCACGTGGTG GAGCGCTTcC TGGAGGAGCA 2040 

GAATTTATAA GcCTGTAGGC AGGCTTTTTA GGTCCGGGTG AGGGCGTACG GGCTGTTGTG 2100 

TTTATACCCT CAGGCGGACG CTCTCGATGT CTGGGCTGAA CAGTTCTCGC ACGTCTGAGA 2160 

TACCGAGCGC CAGGAGCGCC ATGCGGTCTA CTCCTAGTCC CCAGGCCATG ACGGGGACGT 2220 

GCACGCCGAG CGGGTCGGTC ACTTCTGGGC GCAGGAGACC TGCTCCTCCC AGTTCGAACC 2280 

AGCCGAGTGC GGGGTGGAGT GCGTGTAGCT CGATAGAGGG CTCCGTGAAC GGAAAGTACC 2340 
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CGCCCACGTA GCGTACCTCC TGTGCACCGG 
CTAGAAGGGT ACAGACGTTC ACATCCGTAC 
CTGCAAGGTG CGTCGCATCC ACTTGGTCGT 
TACCCGGTAT GTGGGCGGTT GGCAGGTGGC 
ATAGCAGTCG GCGGGTAAAA TCCCGATCGA 
CTGctCCGCG TTCATGCGTC GCCGCAACGC 
CGTGCGTTGG GTGTTTGAGG TAATACACAT 
GCATGAACAG CGCGTCCGCG TTCCAGAAGT 
GAAAGCCAAG TGCCACCAGA CGATCTTTGA 
ATCGGCCGGG GATGATGCGG GCAGGGGGAA 
TCTTCCACGC GCCACTTTTT AGGCACTCGA 
GCCcTGCGGT ATGCAGCGCC TCCTGCACGG 
CCCTCTCGCG TACACTGACT TTGAAGAGGC 
GTTCCATTAC ACGTCGCTCA TCATCAGAGA 
TGTCCGAGGC CTCTGACGGG GAAGCGACGC 
TGAGGGACAT GCGAaTCACT TACGTGCGGT 
ATACGGAGGA TACCCTCCTG CGCTAGGATA 
AGCGTGAsCG CATGGGCAAG CTCAGGGAGA 
TGTAGGTGCT CGGC TGCATC TGCAATAGCG 
AGCATACGCT CCTCTGCAGT ACCGTCGCTA 
AAGGACCGCA TCTG TTCCCg CTGGTGCTCT 
AACGCTTGGT TTGCGTGTCC TTCCTTAAAG 
AGGATCTCAT CCATTGCGCA GTTCTTGAGG 
ACAAGCGTGT TCAGATCGGC TTTACCTGTC 
GTTTAGATGT AGGCTGTTTC TCAATTTTTC 
TGGAGAAATA GATGGAACGA TCCTGGGAGG 
TGGGTTGCAC CATGGTTGCT CCTGGGATGC 
CGTCTGCGTG GGTGGGTGTG GCGCTTATGT 
GCACGCGGTC GGGTGGGCAG GCTCAGGAAC 



545 

CGATTTCTGT TGCGAGGATT 
CCAGTACGAT TCCTTCTGTT 
GAC GGAAGC A GCGTGCGATC 
GCGCTGAAAG TGCTGTTCCT 
ACGAGTAGCG CCAGCCGAGG 
GGGAGAGGAA CGGCTCTGGG 
CATGAATGTC CCGTGCAGGA 
CTGTTTCCAC CAGTGGCCCG 
TGTGTTCGAG G AAATC TGCG 
TGTGAACGTT GTAGCGGCGT 
CGGTGAGTGC ACCTATCTCG 
CACGGGCGGT GGGGGTAAAG 
TGTCGctTGC GCCCCGCTTT 
GCTCAGATTC AAAGAGGGTG 
GTGCAGCGGC GCGCTGAAGC 
GAAACGATGT GTATGCGTTT 
CCGAACGCTG AACCTACATC 
CTGAGCCCGT TGCAAAGTGG 
GTAAGGGAAG GGGGGGAAGA 
GCGGCAGCAT AGCCGCAGGG 
TCGATGATTC GCTTTGCCCG 
CCTAGCCGGG AGATGAGCAA 
ACTTTGATCT CAAGGGGATG 
ATGCGCGGCA TCATAACGTA 
GTTCTGCATC AGGGTACATC 
GTATTGACAG AGGTGTGCCC 
CCGCAGCTTT TGCTCGCTCG 
GCGTTGCGTG GTCGGTCTCC 
GCTTAAGTTC CTGGCGCCAG 
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TTGAGCATGC 2400 

TGGTAGAAAT 2460 

CCAAAATACT 2520 

TGGCTGCGTA 2580 

CTCCCGCTAT 2640 

ATTGTAGGTG 2700 

TGGAACTGTG 2760 

TCAAATTCCT 2820 

TAGGCATTAG 2880 

AGATGTTGTG 2940 

TTTCCGGTAA 3000 

GTGAACGTTA 3 060 

TTTGCTATGC 3120 

CCTGGAGGAG 3180 

AAGGTGCGCG 3240 

TTCACCGTCC 3300 

CTTTGGTGCG 33 60 

GGGGCGAGGG 3420 

CAAGAAGGTG 3480 

GGTGAGTTCA 3 54 0 

AAGCCAGGAA 3600 

CGAAGTCGAA 3660 

CAGCTTGTGC 3720 

TTTTCGCGCC 3780 

TGCCGTAGTG 3840 

GTCTATGGTA 3900 

GTGGTTCGCG 3960 

GCCGCGGAGG 4020 

GTTGTGCAGC 4080 
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GCATGGAGGT ACATCTACGT GCGGCGTACA CCTTTTTTGA GAGTGGGGAT AGTGATCGCG 4140 

CCTATGAGCA GATAGATAAG GCGTACTTTC GCTACTATGA GGCGAAGGGC ATGGAGAAGA 4200 

TCACCATGGG GTATCTGTCC GGTGCGCGTA AGGCGGCGGT GGAGAACGCG TTTTTCGCGT 42 60 

ATCGGCGTTC GGTGCGGGGT GCGCGTGATT TGGCGGGCGT TGCCTTCTGC AGGGACAAGC 4320 

TGGTTACCAT GTTGTATGAG GACGCGCGTG CGCTGGATGG GGTTGCGCGT GGTCGGGCGG 43 80 

GCTTTGCGGC GCATATCGCC ACGTTTGTTG CCTCGTGCGT GTTGGTGCTG CGCGAGGGAA 4440 

TTGAGGCAAT TTTGGTTATC GCAGCGATTG TTGCGTATCT GGTGAAGACT GGTAAGGAGC 4500 

GGTGCTGCGC TGCGGTGTAT GCGGGAGCGG GCGCGGGTGT TCTGTTCAGT GTCGTGCTTG 4560 

CGGTGATGAT AGTCCGGGTG TTGGGTTCGG AAGGTGGTGC GGCGCAGGAG ATTATCGAGG 4 620 

GTGTTGGTAT GTTCTTCGCA GCGGCGATGC TCTTTTACGT GAGTAACTGG ATGTTGTCCA 4 680 

AGGCGAGGGC ATGTGCTTGG GATCGCTATA TCCGTCAGAA AGTTGAGCGG TCGGTGTCTC 4740 

GGGGTAATCA GTGGGCGCTC GTGGCCACTG CCTTCCTCGC AGTGGCGCGG GAAGGGGCGG 4800 

AGCTTATTCT TTTC TTTCG A GGCATCCCAG TTGCGGGGCC ATATGGGCGG CTGGCTGTGT 4 860 

GGGCAGCGGT TACTGTTTCT GCCTTGGTTC TGGTGGGTGT GTTCGTGGCG ATCCGTTTTC 4920 

TGTCAGTGCG ACTTCCGTTG AGGCCTTTTT TTGTTGCCAC GGGCGCGGTG ATGTACTTGC 4980 

TATGTTTCTC TTTCGTGGGT AAGGGTGTCA GCGAGCTGCA GGAGGCAGGT GTGGTCAGTC 5040 

GAAGTACGGC ACCGTGGATG CATGGGTGGA GTTTTGATTT TCTGGGCATC TACCCGACCT 5100 

ATGAGGGTCT GGCCCCTCAA GCGTTTGTGG TGGCGTTGGT GGTGC TTTCG GCGGTATGGT 5160 

GGTGTGGTGG TCTCTGCCGT GGCGCATCCA GCACGTAGGC TTGGGACGGC TGTGTCGCGT 5220 

CCTACTGGGG CCGGGTGTGT GCTGCGCCGT GGAGATTTCC ATTTGTTTTT CTATAATGGT 5280 

GAGGAAAAGA AGCGCTGGAC GGGAGAAGGC GTTTTGAAAA GGAGGGGCGC GTGACGCCCC 5340 

AGGGGAGTGA AGAATGAAGA GGGTGAGTTT GCTCGGGAGC GC AC CATTTT TGCGTTGGTT 5400 

TTTTCCGCGT GCGGGGcGGT GGAGAGCATC AGCACGGTGA GGAGATGATG GCCGCCGTTC 5460 

CTGCTCCAGA TGCAGAGGGG GCGGCCGGTT TTGATGAGTT TCCTATAGGC GAGGATCGGG 5520 

ATGTGGGGCC CTTGCATGTG GGArGGGTGT ATTTTCAGCC GGTTGAGATG CATCCGGCTC 5580 

CAGGAGCACA GCCGTCGAAG GAAGAGGCGG ACTGTCACAT AGAAGCGGAT ATCCACGCAA 5640 

ATGAGGCGGG TAAAGATTTA GGGTATGGAG TCGGGGATTT TGTGCCGTAT CTC CGAGTTG 5700 

TTGCTTTCCT CCAGAAGCAT GGCTCTGAGA AGGTGCAAAA GGTGATGTTT GCGCCCATGA 5760 

ACGCAGGGaC GGTCCGCATT ATGGGGCGAA CGTGAAGTTT GAAGAGGGGC TTGGTACGTA 5820 
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CAAGGTACGT TTCGAGATCG CTGCACCCTC GCATGATGAG TACTCGCTAC ATATTGATGA 5880 

GCAAACTGGG GTTTCCGGAA GGTTCTGGAG CGAGCCATTA GTTGCAGAGT GGGATGATTT 5940 

TGAATGGAAG GGGCCTCAGT GGTAGGGACG TTCAGAAGGT CCGAGGGTGC GCGCGCATAA 6000 

GGGCGTTCTT TGTTCAGTAA GACAGGCGGG TAGTGCAGTG CGTGGCGCTG CTCGCCGGGT 6060 

CCGTTTTGAG GGTGTGGGTT TTGACACGCA gTTATTTTTT TGAAAGTTCT CCTGCGCGTT 6120 

CTTCTGTCTC CGTGGGGTTG TGCGGTGTAC AGAACGGGGG GGGGTGTCGT GAGTGCGGGT 6180 

ATGAAAGTCT TGGTGTACGC GGTGGCGCTG GGGTTCGGGT GCGGGGGTGT GGTGCACATG 6240 

CGGGAGGGGG ACACCTACCA ACAACTCCTC GAGCACCGCA TTGCAAATGG TCGGGAGTTT 6300 

TCGCGGGTGT TTGCgCAGGC ACAGGTTGAC GAAGCTGAGC ACAATGAAGT TCGGACAAAG 63 60 

ACGGCGGGAA GTGTGCAAAT TGGCACGGGA GACGTGCTCT TCAACAAGAA GAATGGCAAT 6420 

GGTGCTAACG GCTACAAGGT GGAGATGGCG CCGCATTTGA GTATTGCGTC CCCCTTTATA 6480 

GGAAATTCTC GGCTGAATCT TGTTGCCCCC CGCAAGCTTG ACGGTGTCAC AAGTACCTCC 654 0 

ACCGTGTCGG TGGATTACAC TACCGATTTT TACTCCTCCG TTCGTCCAAC ATACCTGAAC 6600 

TCCCTCAAGG AAAAGACATA TCAGAAGGAG AAGAGCGGTT CGGCGCTGCG TGATGGGCGC 6660 

AGGCTAGTGG AACGGGAGTT TTTGCAGGAA GTACAGCGTC TGTACGGTAG TTACGCGGAC 6720 

CAGGTGCGCG CAAGTTTGGA GTTGGTGCGC GCGCGGTTGC GTTTTGAGTC AGTAAAGAGA 67 80 

CAGGGATATC AAGAGGATTC GGCGTATTTT CAGAGCGCAC AG CTTGC AC A GGTGCGGGCG 6840 

GAACGCGCCC GGGCACAGGC CAGGCAGCGC TTTGACCTTG AGTACACGCG GTTTGCAGCG 6900 

CGCAACGGGG TGGCCTACGA GGACGATGAG CGCGACGGTT TTTTACACGA TTTGGCGGTT 6960 

GCAgTGCCGC TTGAGCCGGC GATGGCGGTG ACTCAGTGcG nCAGGGGAGC GGGGGCGcGA 7020 

GTATTGTGAT GCGCAGGAcC GCTGCGAGCG CGTCATTGCC CAGCGAGGTA CAGATTACTC 7080 

CCCCTTTCGC ACAAGCGCGC GCGTGTACTT TACCGATGGG GAAGAAAACA AGCAATTAAC 7140 

TAACGGCATG GCGC C AGCTG CTCCTAGCAC TACGAGCACG TATGGGGGCA CGTTCAACAT 7200 

GGCGTTTCCC GGCGGGGATT CCAGTTTTAC CGTGCAGAAT AGTAAAGGGC TGGCGGGGAT 72 60 

CCTAGcGAAT TTTGAGTGGA GCC CGATACG CACGCcTATC GCTCGCTGGA CTACACGGCA 7320 

GAGCGCGCAG AGCGTGTcTT TGACGAAGTT GAGCTTCAGG CAAAGGGTGA TCGGTCGAAT 7380 

AAGTTGTTCG GTGCTATAGA CGCGCAGGGG GACTCAGTGC TGGTgcTGCG CGGGGTGGAT 7440 

TTGCAGACAC TGGATAACGC GCGCAAGAAG GCACGGTTGC AGAAAGAACG CTTGGAACGT 7500 

GGTATCATCG GAAGGCTGGA GTACGAGGCA GCGCGTTCGG AGTATCTTCT GGCGCTTGCT 7560 



Printed from Mimosa 02/03/22 07:26:28 Page: 549 



WO 98/59034 



• 



pct; 




548 



TCTGTGGCAG AGGCAAAGGC GCGGGCGATT ATTTTTAACA CCGACCTTGC GTGTGCCTAC 
GGGGTGGGTG CGGACGCCGC CGCCGCTCAG TTGACCCAAG AGGAAATGGT GGTCTCTGAG 
AAAAAGGATG CTGAAGAAAA GAAAGAGAGG TCTTCGTGAG CGTAAgTTAT CGTGGCCCGA 
GGTGGTCTTC GTTCGTCCAC GTGTCGCAGC ATTCGTGTAG GTTCGTAGCT CCTACGTGCg 
CTGAGGGTGC TCAGGGGTGC TCTGAGTTTG GGGCGTTCCC TGTTTTTGAG GAAAGGGGAA 
TGTGCGCGGC GCGGCGTATG CGCAGGGCGG CAATTGCCGC GTGCTGTGTG. TTCGCGCGCG 
GTGCGGCGGC CAATCCGTAC CAGCAGCTAT TGCGCCACCG CCTGGAAGCG TTGCGGCCGG 
GTGCCCGCGC GCAAATAGAG TTTGATGTGG CGCACTGTGG GTATGAGAAG GCgCGtTGCG 
CTcAGCAGGT ACGTACGTGT TGGGCAGTGA GCTTGAAATC AGAGGACACT CgGCGGGGGA 
TTTTGGGCTC CCTCGCTTTG GAATAAAGCC CATTATCGGC GTGAGAAGTC CGCGCTACAA 
TAACCTGGTC GTGTCCATCG AC AC CGC AAG GtAACTAGCA TAGGGAATAT ATCCCGGATA 
AACGCGGATA TAGGGGTGGA TTTGTATTCT AACGTGCGGG GGCGCGAgcT CATTCGTATG 
CGTCGTGCAG AGCAaAAGaA AAGGCGGCGC AGAACGGTGA ACGAATTAAA TCGCCGTcGG 
TGGAGCTAGC GCTCATCGAT GAGCTGGAAG TGCTTTTTAC CCGCGCGCAG TCGCTCGTGC 
GGCGAGAGTT TCATATGGGG GATGCGCGTT TGGTGCACCT GCGCACGCGT GCGsCAGGTT 
TTTCTGAGCA CTCTGAAAAG GCCCGGCGCG TCCGTTTGGC GTACGACCGC ACACAGCGTG 
AGTTTGAACA AGAAGAGCGC CTGTTTGCGC AGGTGTG TG A TCCCTTCGCT GCCGTCTGCG 
CAGTGGGCGG AGGGGATGAA GC GC GG AG AG ACTTTTTGCT GCAGCTTGCA GAGGCGGTGC 
CGCGCGAGGT ACCGCTCTCG CTCGTTTCCT TGCATGCTAC AGATGCGCAC AGCCTTGCGG 
CGGcGCAgGA GATGGCACTG CTTGAACGCG CCGCGCAGAT tCGGAGCGTG ATTTGTACGC 
TGTGCGGTGG GCGCTGTTGT GAGCATGGGT ACACGGAAGA CTTTCATTCT GTTTAAGGGA 
GACGGAACCG AGTCGCTTGA AGGTTCCGGG ACGGTGGCGC TGCATATGCC CAGCGTGAAC 
GCGCAGGTAG AGGTAAAGGT GCCCTACGCG GAGAGGGGTA AGCATTCCCG TGACAAGGTG 
GGAGTGTACG GGAAGTCGCA GTGGAATCCG CTTGAAATTG CCTATAAGGT GTTCGAAAGA 
CGGGAGGAGC GGGCGCAAGA GCAAGAACAG GAGCAGTATT GTGAAGATTC CCTGGCGCGT 
GAAAcGcgGA aGATGGAGGG GTTAGAGGTG CAGGGCAAAC AGCTTTTTGC AGCACAAGAA 
ACCGCCTTGc GCACGCGCGA GGCgCTGCGT TTAGATCTTG CCaAGGTGGa rCgCGCCcgG 
CGCGCGGGkT AGTGGGAGGA AATCGCCTCG CGCGTGCGCG kTGTGAGTAT GCCGTGGCGC 
AgcTGCGTGC GGCGTGCGCG AAGTTGCATA TGTTGCGTTT TAATCTGGGA GTGGTACGCG 



7620 
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8100 
8160 
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CATTTGGCCT GGTGCCACAG 
GCGCCGGGCG GCCGTGCGTG 
TGGGATGGGC gGsTGCGcTC 
GCGCCCGGGC GGCCGCGCAA 
GGCTGCAGCG CAgTACGGCG 
AGTACGATAA GCAGCGGCTT 
CGTGGAACgc CGATGGGGTA 
CTTTTTATAA CCTGACCACC 
GGGGAGG AG s skGwnGAGGG 
GCATTGATTT GTAC TCGTCG 
ArGsmnTGCG TGATGCGCAA 
TGCTCGAGGA CATGCGCCGG 
CGTTTGCGCA AAAGCAGAAC 
CCATTGTGTA mCGCGCAgCA 
CGCAAGACGC GTTTGACGGA 
TAGAAAAACG TGCGGATCAG 
TGCCGCTGGT GTCCACCGAG 
TGAGGCAGCA GATAACGAGC 
TCTTGACACC CGCTTTACCC 
CGTCAAAATA ACCAGCGCCC 
GTCTCTGGAC TGGCATCCGT 
CCTGCACGAT GCGTTAGGGG 
AATCGCAGAC CTCCCACAGC 
AGAGCGGGAC ACGTACGCAG 
TATCGGCGCG CGTGGcTACG 
GGCGAAGgCG AATGTAGACG 
TTCTGGAACC CAAACATGAA 
GTAAGGTGAT CGCACGCAGG 
GCGCTCTGTG TGGGGCGCTT 



GTGGCGCCGT 
TGTTCGCCGG 
ACCGCCAGCG 
GAAACGGGAA 
GCGGCGCAGg 
GATTCCTTGG 
AAGTTTCGCA 
CATTTTGGTA 
GGAGGAGGCG 
GTGCGTCGCA 
GAAGCGCTCG 
ATGTTGGATT 
gcAGAGCGAT 
GCGCTCGAGC 
GAGTACCGGG 
GAGCGCTTTC 
CAtGCGAGGC 
GCGAGGAACG 
TAGATGAAGG 
TGGCCATAGG 
TTGAAATCCG 
CACGGGAGTA 
GTGCAGAGGA 
AAAGCGCCCG 
CGGCAGTACA 
CGCTCATTTT 
CATGGGGAGT 
ATGCTTTGCG 
GCCGCCTTGG 
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GAGCGGTCCG 
AAGCGTGCTG 
AGTTAACGCC 
CCGACnTCTA 
CTGTCCGACG 
TGCGCCTTTC 
TTACGCCCAA 
TGACGGTAAC 
ACTGGCAAAA 
GCCATGTGTT 
CCTGTGAGCC 
CCTACGTGCA 
CAGTGCAGGT 
GGGAGCGrGC 
ATTTTATCAT 
TGCTCGCGCT 
AGATACGTCC 
GGCGGTACAG 
AACCGGGGAG 
TTACACCGGT 
GTACGCGCAT 
TGCACAGAAA 
TATC CTCTGG 
CGCGCACAGG 
TTTGGACTAC 
TAACATCGAC 
CTGGTATCTT 
CGCGCCCGTG 
TGCCAGCAGT 



CGGGGTGGTT 
TGGCATCATG 
CGGCGCACCG 
CCAGCGCGTG 
GCAGACGATA 
TATCGCAGCC 
GGCnTCGGTG 
GCAGCCGAAC 
GACgCTCGAC 
TGCGGTGAAC 
GCACGTAAGT 
GCTGTTGCAC 
GGCTGGATAC 
ACAGGACGCG 
CTCTGCTGGT 
GGCTGAAAGC 
CGCCCTCTGc 
AACTTTCCCG 
CTTTCCGTTG 
ACGCTCAAAA 
TTGCGAGGAA 
AAGGAGCAGC 
GAGCGTGAAA 
AAAGGACTTG 
GTACGGGCGG 
GCGCGCGTAG 
ATGC TGCAAT 
GGGGCCGTCG 
CGGTGCGCAG 



CGTATGGC AA 9360 

C AGTGTTGGG 9420 

CCGGCGGCAA 9480 

GTGCGCTATC 9540 

ACACAGAGCC 9600 

GGGGACATTG 9660 

GCATTCCCTT 9720 

GGTGCCGCCG 97 80 

GCGGGGgCAG 9840 

ACCAAGTACG 9900 

GAGAAGCAGG 9960 

GCGCAGGAGT 10020 

ACGGACCGCT 10080 

CTCAAGGTGG 10140 

CAGGAATTTT 10200 

GTTCCTGAAA 10260 

GCAACGCGCG 10320 

TGGCGCTTCG 10380 

CGTTTCCAAG 10440 

GCATTGGCGG 10500 

AAAATCAGCG 10560 

AGGAGAAAGT 10620 

CTGCACGCGC 10680 

ATCGGGGAGT 10740 

TTATCAATTT 10800 

ATTTTCTTTC 10860 

CCAACGGGGG 10920 

TGCGTGGTGT 10980 

GAACAGGCAG 11040 
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TGCCTGCGCC GGGGACGCCG GCTCCTCCCG CACACACGGC TTCAGAAGCG GTGCCTCCTG 
CGCCAGAGCC CCGTGCGGAA GGGGAGCAGC CGTCTCCTCT TGTCCCCACG cTCTGCCGGT 
CCCTGGAGGG GCAGTGGCTG CACGCGCAGC GCCsGGCACA GTCGGTCCGC GGCTGTGGGA 
GCAGCTGCTG CAGTGGCGCG TGCAGCACGG TGACGAACAC CAGGCGCCGC AAATGGCCTA 
CGAAATTGCC GCGAACAATT ACGACATTGC GTTGGTAAAG TCCATCGTGG ATCTGAGGAT 
GGGGACTGGA CACATACACC ACAACCTGAA TGGGAACGGG GCCGGGGGTA TGGCAAACGG 
TACGCCGACG CTTTCTCCCT ACGTGCATCT TTTTTTTCCG ACCTATCAGA ATTTGAGTTT 
AAAAGCGGAT ATTGCGATCA AGACCAACAC CCcTTCGGCA GACGTGACCG CGCTCTTTGG 
TATGGATCTG TACTCCAAGG TGCGGCGGCA GCATCAGCTG CAGGTGCGGC GTGCGCGCAA 
TAGCATGCTT GACGCGTTTG CGGCGCACtG CGGGGGCAGc ACgctGCGCG GGAAGCGTTC 
CTGGC TGAGC TCGATGAGCT GCTAAGCGCA TACAGCACGC TGCTTGAAGC ACAGGTAACC 
GAGCAGGAGT GCACGCGCCT AGTGCGCACG ATGCGCATAC AGCGCTACCA AGCGCATTCG 
GTAAAGTTGC GctCCgCAAC GCTCAAGCAC GCACGCGCAG AGAGAGTTGC CCGTCGTGCG 
CGCAAGACGT TCACCGCCCT GTATCAGGAT TTTGTGCGCA AGTGCGGGGC CTTTGAAGGA 
AATGATCCGG AAACATTCAT GCTCCATCTT GCGCAGGTAG TTCCGCAGGA GCCCGTATCT 
TCTAnCCGCA CTGCTTTCAG TGGAAAATGA CTGGGAGTTT CTTAAGAACA GGGAAGATTT 
GGAAACTCAG GCTGAAGCGC GTGCAGTGGA TGCTATCTCG TACGGGTTTA ATGTGGAGTC 
TGGGGTGGGG TCTGAGGGTA AGTCATTGAA gAGAATATTG GCAAATGTCA GAAtGGACTT 
TCCCGGCGGT GGCTTTTGGC TTGGATTGAA CTTACCGTAC CCGcAGTGGT CCCGTGTGGA 
GGTAAAATTT CGGC TCACGT GGGACCCGCT TTCCATTAAG TATcAGGAGC TTTCACGGCA 
GACACTGCAG CTTCATGAGC GGCTCAGTGC GCTTAAGCTT CAAGACGCGT ACGAAGCTTC 
TGAGCGTAAG GTGCTTGGCC TGCGCCACAC CGCCGAGTCG CTCGGCTGGG AACAAGAGGC 
GGCACTCACC GAACTGAATA TTCTCAGGCG GAGTGCGCAA ACGCACCAGA AGTGGCTGGA 
AAGAGGAGCT ATCGGCGCGC ATCAGCACGC CCGGGCCCAG CACGCGTACC TACAGGCGCT 
CATCACGTTG GCCAAGATCA ACATTAAAAT ACTAAAGTTT AAC CTTGAAA CTGCGTCTTC 
GTTCAGACCA GTACTCTAAA GAATACCCCA AGAAGGAAGT TGTATGACCA CAGCACAGAA 
ACTCCTACAC AGAAAATCGA CCATCGCCAT GGTGGTCGGA ATTCTCGCCT TCTTATTTGT 
TCTTCCCCGC TTGGTGCGGG CGCTGCGTCG GGTTCCGCCG CCTACCCTCA GTGTGAGTAA 
GGAGGTGGTG CTCAATAGGA TTGAGATTTC GGGGTACATC GAAGCGGCTC AGCACCAAAA 
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PCT^B 


tl3041 


GCTTGAGTCC 


CCTGGTGAGG 


GAATCGTGCG 


C AC CGTACGG 


GTGCAAGAGG GAGATACGGT 


12840 


GAAGAAGGGG 


CAACTCCTCT 


TTTCGCTTGA 


AAACTCTCAC 


CAGCAGCTTG 


ACCTTGCCGA 


12900 


GCATGAGTTT 


GCAATCGAAC 


AAGAAGAAAT 


TAACGGTGTT 


TCTAAAAAAA 


TGGAGATCAT 


12960 


GAAGCTAAAG 


AGAAATATGC 


TCCAAAAAAG 


ACTGAGGGAA 


CGCTACGTCA 


CTGCCCAGTT 


13020 


TGATGGCGTT 


GTTGCCGCTT 


TTAAGCTCTC 


TCCCGGACAG 


TACGCGAAAC 


CTCAAGATTA 


13080 


CTTTGGCACT 


CTCATCGATC 


GCTCTTACTT CAAGGCAAAT GTCGAGATTC 


CTGAGGTGGA 


13140 


CGCTTCGCGC 


CTCAAGGTAG 


GGCAGCGCGT 


TGAAATTTCT 


TTTCCCGCAG 


AACCAAGCGT 


13200 


GAAAGCGGTG 


GGGAGTGTCA 


CTTCCTATCC 


GTCCATCGCG 


CGCGTTACCA 


GTGTCGGGCG 


13260 


CACCGTGGTT 


GACGCCTCCA 


TCAGGATCGA 


TGAATTGCCA 


GAAATACTGC 


CGGGTTATTC 


13320 


CTTCAGCGGG 


GCAATTGTTG 


CCGGGGAGCA gGAGGAAATT 


TTAGTCCTGA 


AAGCCAAGAC 


13380 


GGnCTC CGGT 




GTGCTCCGTT 


CGTGGAnCGA 


GTGCTCCCCA 


GCGGTAAGAT 


13440 


AAAGTCTGTG 


GCCGGTTACG 


GTGGAGCCGT 


ATGTTnCCTG 


GCTTTGGTCA 


AAAATAAATT 


13500 


TCTGGGGCTG 


GGGGGCGG 










13518 



(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



AAAATGACAn 


AAGCACAACG 


GGnAGCGGTT 


GGAGGTTGCC 


GGTGACATGC 


AGCGCATGAT 


60 


GAATGCAGGG 


CGCGCGAAAC 


AACGCACGGC 


GCACAGGAAG 


CGCGTGAAAG 


TGTCACGCCC 


120 


TGCTACGGCC 


TTAAGGTGGT 


GGATGCCCAG 


CACTTGTTAT 


CGGAAATCGT 


GCTCGTTGAT 


180 


CCAAAGACTG 


AAGCGCCTTT 


GCGTTCTTCT 


TCCCTACGGA 


CCGTCCGCAA 


CCGGCTCCTG 


240 


TACAGCGAGC 


CTCACGCGCT 


CGTCGCCATT 


GCTGACACGA 


CAGGGAACGG 


CACCGTCCGC 


300 


CTCGTGCACA 


TAG AC CCAAA 


GACGCTGGAG 


GTAACCAAAG 


AGAGTACCCA GCGTATAGTG 


360 


CGCAAAGTTT 


TCTCTTGAGG 


GAAGAGGAGC 


AtACTATGCG 


GTGATCGACG 


AAAATGGCAG 


420 


CCACTTCCTG 


GGACGCTTTA 


CCAAAAATCT 


TGAGCTGACT 


ACTCGTTCTG 


CAGCGCnGnT 


480 


GACGCCTTAT 


ACCGCCGTCA 


CCGTCACTCC 


GCGCGGAATT 


ATGGTGCAAA 


CAAAAGAGAA 


540 


AGGTATGGCC 


CTATTGCACA 


CACGGACGCT 


CGCCGACGCG 


CTACCCAGAA 


CATGAGCAGA 


600 
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ACGGTGGTGG TGCTCGCCAC 
AGTCTGCACA GTAATGCTGA 
ATCAAAGGGG AATTCGAAGC 
AGAAgTGGCG CGCGCATGCA 
CCGCGCATCG GCTACATCGG 
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TACGCGATTA TCCTCGACGG 
GTCTGGTTCA TTGCGCAGTC 
GCTATCAGCT CAGGACAGTT 
CTCCTCACCG AAACATTCGT 
GAAGCGTTTG GCAAATTTGT 
GCACTGGGAG GGGAACGGAA 
GAGATGTCGG AG AAGCTTC C 
GTATGGTAGA CTGCATCGAG 
TTATGGCGAT ATGGGGAGCG 
GCATGAAAGC GGTCTTCCTC 
CATGCTCAAA GCCTCGCATT 
GTCAAATCGG CTCTTCCAAA 
CAAGCAGGAT CGAAGCACTG 
CATATGAGCT TGTTAAAGAT 
GAAAACGAGA AC CACTGAAT 
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TGTGTGGCCG CTTACGACGA AACTCATCGG TATTATCAGC 660 

CATTGCGGTT ACGGCCATGG CTTCCTGGTT CTTCGCTTcG 720 

GCTTAATAAT CTTGCCGCTG CGGAGAACTT TGCTGCGCAA 780 

CATTGCCACA AGCGCCAAGT CCTTCGTTTC CCTTGGCCTC 84 0 

CTCGCGGTCC GCACTTTCTA AAGATTTTTT TTCCTTTTAC 900 

AGTCGGCGGT GTAGCCGAGC TGTGGAACGG TGACTTCTTC 960 

GGCAGACGCT CGTCGCTTCC TAGCTGACAA CGCACAGGTT 1020 

CCCAGCCACG CTCAACGCCG CCCCCTGGTT TAAAGCGCAG 1080 

CTTTGAAGTT GACGGcGCTA CGCGTAACGT TGTGGTTATC 1140 

GCACCTGCTA GAATCTGGAG CCTCCTCCGG AACCATGTAT 12 00 

CTCCCTGTAC CACCCGGAAT ACTCTCTCAA T t AC AGCAAC 1260 

CGTTGTGCGC GATTTACGCG AATCTACACA GCTGACCAAA 1320 

GGACAACAaG CGCTACTTCG GCGCGTTCGC CAaGCAAACC 13 80 

CCTAGAAACG CCTATGAGTG TGGTGTACCA GGCAGTATAT 1440 

TATCCTCACC GGCATGGTGC TCCTCGCCTC TATCTTGCTT 1500 

TATCACCCGC CCTATCCTTA CCCTCGTCGG CGCAACGCAC 1560 

CCTCCTGGAT ATCAAGCCTT CAAGCAAAGA CGAAATTGGC 162 0 

GAGTATGGGG CGTGGTCTGG CAGAACGGGA ACGCATGAAA 1680 

AAATAGAGAC ATCGCAGAGA AGGCCATGAA GGGAGAGCTC 1740 

AACCGCTACC ATTTTTTTCT CAGACGTGCG CTCCTTTACT 1800 

CCCTGAGGAC GTAtAGAGTT TCTCAACGAG TACATGAGCT 1860 

CAGACAGGCG GCGTGGTGGA CAAGTTTATT GGAGATGCGA 1920 

CCAGTTTCCC TCGGCTCTGC ACGCTTAGAC GCATTGCAGA 1980 

ATGCGCGAAA GCCTTATTCA ACTGAACGAA AAGCGCGTCG 2040 

GGCATCGGAT GCGGCGTAAA CACAGGCTCC TGCGTCGCAG 2100 

CGTATGGAAT ACACCGTCAT CGGAGACGCG GTGAACACCG 2160 

AATAACC cGT TCGGCACTGA CTTTCTTATC TCCGAAAACA 2220 

ATGCTTATAG TGGAGAAAAT GCCCCCCATA ACGGTAAAAG 2280 

GTGTACGCTG CTATCAATCT AAAGGGGCAT GACGGACCGC 2340 
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PCT^I 


Il3041 


AGACGCTCGA TGAGCTGCGT GCACTTCTTT 


CCATTGAAAA 


GCCGGGGCTT 


TCTGCCGACC 


2400 


CTGACTTCGA 


AGAAAAGAAG 


TGTGAAGTTA 


TCTAAGCAGG 


ATGCCACGGT 


TACGGTCGTT 


2460 


ATTCTCCTCC 


TTATCCTGCT 


TCTCGGCTGG 


GGCTACTCCC 


GCGCGCTCCG 


TCTGTCCCAG 


2520 


GGGAAGGGAA ATCCAATCGG ACGGGTTTTT 


TTTTATAAAA 


AAACCGCAAC 


CCGCAAAAAA 


2580 


AACAACCAAG 


CCTTATGGCT 


CAAACTCAAA GACGGGGTGC 


CCGTCTACCA 


TCGGnGyAss 


2640 


TGCGCACCAC 


CACCGGTTCT 


GAAGCTGTCA 


TTGTGTTCAC 


TGATAACAGC 


AGGCTCGACA 


2700 


TTGCAGAAAA 


TACCATGGTG 


CGCATCAGTc 


ACACAGGAAT 


GAAAAAGAAG 


GATGTACGTT 


2760 


TGGTCACAGG 


AGCGATTACG 


t ACGC a CGCG 


CCGCTGGGAA 


TCCAGCAGCG 


CATACCGTAC 


2820 


ATGTAGGAAA 


GACAACCATC 


TCGCTTTCTG 


GAGACGGTCA 


GGTGAATGTG 


CGCGGAGGCG 


2880 


AACGCGATTC 


AAc TGTCGAG 


ATAGCACGCG 


GTGAGGCACT 


CCTTC AC GAT 


GCGCAGGGAC 


2940 


AGACAcTTCC 


CCTTCAGACG 


TTCACCCAAC 


TTGCTACTTC 


CCGGGAGGAT 


GGCACTGTGC 


3000 


GCATTCTGCA 


CCCCACCTTT 


GTCCCTCTCC 


TACCCGACCA 


AGATGCACTT 


CTCCTGACTG 


3060 


CCGAGCACAC 


CAGATCTGTG 


GGCTTTGTCT 


GGCTCGGCGA 


TGCCACGACG 


GTACAGCCGA 


3120 


GCGTCCGTCT 


CCAAATTAGC 


CGATACGCGG 


ACTTCTCGGT 


TATTGAAACG 


GAAAGAAAAC 


3180 


TTACCCTTCC 


GCATGAGGCA 


AACGCCTCGA 


GG AC AAC ATT 


CAAAACCAGC 


GAACGACTCG 


3240 


GGGAAGGACG 


CTGGTTTTGG 


CGCCTGGTCC 


CGCAGAACGG 


CACGcGTCAg 


CGCCCCGTTC 


3300 


CTTTTCTGTG 


CGTCGCGCGC 


GtAAGGTGAT 


GCTGcACACG 


CCGCGTGCTC 


AGGCAGTACT 


3360 


CTCCTATCGG 


GATGCGATTC 


CTCCTACCCT 


TTTTTCCTGG 


ACGTCTGTAG 


AAGACGTGGA 


3420 


ACAGTACCGG 


CTACTGCTTT 


CTTCCCGGGC 


CGACTTTAGC 


GCGGATGTGA 


AGACATTCTC 


3480 


TTTGCGTACG 


CCGGAGATCT 


CGGTACCCGG 


GCTCGGCGAG 


GGAACGTATT 


TCTGGAAGgT 


3540 


AGTACCTCGC 


TTTGATGAGG 


GAATAGAAGA 


CCCAGTCTTT 


GCTTCTGAGG 


TAGGAACCTT 


3600 


CTCCATCAAA 


CAGGGAAAGG 


AGCTGCATGC 


GCCCGTTGCG 


CTCTTTCCCG 


CCGAGGACGA 


3660 


GGTGCTCGAA 


CACGCGGATC 


GGGAAAATCG 


CATGGTAATC 


TTTACCTGCG 


AGCCAATACC 


3720 


AGAAGCACGG 


CGCTATGTCT 


GGACGGTTAA 


AAACATGGAT 


GCAAACGCGT 


CCCCGCTTGT 


3780 


GACTACCACG 


TCGGTACCCT 


TTCTTAC CGT 


TCCCATGCGG 


AGCCTGCGTG 


CACGATTGCA 


3840 


GGAAGGAACA 


TATCAGTGGC 


AGGTAGCGTG 


GGAAACGCGT 


CGGAGCGATC 


GCTCCCCCTA 


3900 


CTCGGCACTG CGCGCGTTCA CGGTCATTGA AGGAATGCAC 


GCGTGGGAAG 


AGGAGCCAGA 


3960 


GACGCGTGAC 


TTGATTkCGC 


TCcGCTcCTT 


CCTTTGgyTG 


CGCGACATGC 


CAGCACTCAT 


4020 


TACTGAAAAA 


TACCTTTTGC 


AGCATCGcGC 


GTTGCGTTGT 


AAGTGGACGG 


CGGTGCACAA 


4080 
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CGCACAGCGG TATACGGTGA CGTTAAAAAA CAAGAAGACA GATGCGGTAC TGCAAACGGC 4140 

AACTACCACA GGGGTGGAGT TCTCATTTAC CAACTTAGCG CACCTTGAGG AAGGGTCATT 4200 

TCATTGGGTC ATACAGGCAC ACACAGAGCA GGAAGGCTAT GAGCCTGCAA GTGCACAGGT 42 60 

GGTGCGCGCG TTCACCATAC GGGTGTCTGA ACTTGAAAGG CCGCGCGCAA AAGAAATTGT 4320 

CCATTATGAG TATCATTAGC CGCGTGTGTA TACCGTGTGC GGTGCTGCTG TTTGCGCAAC 4380 

TGCACGCGAA GGAACTCGTC CACGTATCTC AGTTAAAAGA ACAGGAAGCG CGTATCAGCT 4440 

GGCAGGAA 4448 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CGCGCAGCGC GGTTTTCAGA TTTTGCGCCT CTTCTGCACT GAATCCACTG ATACTCCCTG 60 

AGCCCGCTGT TATCGGCTCC CGGATAGCmG GGGCAGACCT aATTTTACCG TCAGAGACAA 120 

TTGCAAGGCG ACGCCCTATC TCCTTAGTGG TAAGTTCCGA AAAAATACGC GCTCCTTCAT 180 

GGTCCAGGTC AAACAGCACC AACGGCTCGT TCGCGCGACC TGAGCTCACC GTTGCATCAC 240 

GAATATGTCT TCCTTCAAGC GCAGGCTCCT TCTTAACAAC CAGAAACCCG TCGCGCACAT 300 

CAAGTCCGTA GgAATCCTTG CGATATACTC CGAGCACACT GGTGTGCTCA GGAACCAGAG 3 60 

ACAGATCGTG CAACTGATGC GCAgskTCGA AGGTaCccTG CGGGTTATTG CGATAGTGAT 420 

CGAGAAGCTT TTGAGTCGCA TCATCATCCA CGAGATGAAA CGCCAGGACA CCACGACCCA 480 

TGACGATAGA ATGAACACGG TCACGGTCAG TAAGACCAGG AATCTCCACA TACACGCGAT 540 

CTTCCCCTTG cCTCCGAATA ACGGGCTCAG AAAGACCAAA GCGATTAATA CGaTTCTCAA 600 

GGGTACTAAG CACCAGCGCC ATCGCTTCGC TGCGTATTGC GGCGCGCTCT GCATCCGGAA 660 

CTCCCTTGGT AACTTCGCTC AAATCAGCTT TAATCACCAC GCTAGnGCCG CCGGAAAGAT 720 

CAAGCCCGAG CTTGACAGCT TGGGCCTGTC TCCTTTTCAT CTTCAGGACG GCTTCCCGGT 7 80 

ACGTCTGcTC CATCAACGGT CGCGCGTAAA GAACAAAACC CTGCTCGCTT TTTACGGGGA 840 

ATGCAGAAAC GAGTGCCGCA GCGGTCCAGC GAGAAGGGGC CGGTCTGCCC GAATAAGAAA 900 

GATTCTGGCG CGCAGGcAaC AAGCGGTGCA TAACGCGCTG AGATATCCTC ATCCGACCCC 960 
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GCACGCGCAA GACGGGTTAA ATCCGCAAGA TCACGCTCAG CACTCTGCAC AGCGTACTCT 1020 

TTTATCTGCT CGCGCGAGCT GAGCGCACGC TGCCGCGTTT GTGCGTCGGT CAGAAAATAC 1080 

CACTGGAGTG TAGGGAACAA AAAC CCAGAG CACGCAGCAA GAACAACAAG CACGACCCCA 1140 

AACCGAGCCT TCTTACTCAC CTGGCGATCT CCTTGTCCAC ACCTGTCAGG GGCACGCCGG 1200 

GCTTCGAATC GCAATCTGTC TTAGGATTCG AAACACCTCT CCTGTCGTTT ATGCGCGCAA 1260 

TCGCACTGCG GCTGACTTCG AGCGTGCCAT GCTCATTCAC CTTTATGACA AGGCTGTGCT 1320 

CCCGCACCAC GCTTACCACC CCGTGGATAC CGCCGATAGT AACGACAGGA TCACCCTTTT 13 80 

TTATGTTCTT AATAAGAGCC TGCGTCCTTT TCTGTTCCCG CAGATTAGGC GCAAAAACAA 1440 

AAAGGTAAAA GATCAGACAT ACGACGCCGA TAGCGAGCGG TGGGATCCAG CCACCGTTCG 1500 

CCGTAGTGAT TTGCAAAAGA GTTCGATGGG GCATTGTTTT CATCCTTGAG CGCGCAGGAC 1560 

ACACGAGCGC GCCCCCAGGC TAGCGCAAAA AAGACAATCC AGTCAATCAC ATCTCTTCTT 1620 

TACCAaCGCG CGyGyGCGCT gGCATTAATC TCAAAACGAA TCCATATCGG GCAACTCTAA 1680 

AATCAGCTGG ACTTCACCCT TAGAAATCTT CAGTGCGTGC GCAATAGCGT CGTCAGACCA 1740 

GCCAC TTTTG TGCAGCTTCA CCACATTCTG ACGTGTAGCC AGGGGCGGCG CTCCCGCACC 1800 

GGGTATTTTG TTTGCTGGAT CCTGACGCAT CAAATCACCC AGCAAGCGCA ACTGCCCCTC 1860 

AGACACCTTA GAAATTTCTT GCAGACGAGT TTCAGTACCC GCAAGCCACT CACGCGCATG 1920 

CTGTATCTTT TCAATACGAC TTTCCATTTC TCCCAGCAGC GCATCTGCGC ACTCTATGCG 1980 

CGAGCGCACA CGCTCTGCCT TTTCCTGGTT ATCCAACAAT ACGGCAATTT CTGCACGCAC 2040 

ACGCTGCAAT TGAGGATCTA CTGTCTCCAA TTCTCCCCTA AAATTTTTAA GCGTCTTTTC 2100 

AAGTTCC TTC AAGTTTTCAA ACGCTCTATC CACGTCCTGC ACCGTCTGAT CCAACACCGC 2160 

ACCTTTCTTA TCCAAACGCT CATAACGCGT ACTGATATCG CCAAGCCCTT CTTTGACCTT 2220 

TCGAATTTGC ACCTGATAGC GCTGCAGATC GTCATTCGC T AACGTGAGCT CAACAATCTT 2280 

CTTGTCCATA GCATCAGAAA GAGCAGCAAG CTTTGAAAAC TCTCCCTCCA AAAGATCAAT 2340 

ATTCTTTCGC TCCTGCATAA ACTTCTCAAC ACGCTGctCC GCCTCCTCTC CAAAGTGTTT 2400 

CACTTTTTCA TAC TGG AG AC TAAGCTTATC CATCGCCTCT CGATACACTT CGAAACGGGT 24 60 

CACCGTCTCA GTCAGCCGCT CGATATCCTT TTCAAGATTC TCCCGCAACT CGTCCGCCCG 2520 

ATCAAAAATA CGAGTCTGGC CGATAAACTC ATGCTGTTTA CGTTCAATCT CCTGCAGCAC 2580 

TTGCGAAAAG CGGTCACTTT CTCCCTGTAA TTTAGTGAGA AGACCTACCT GCGCCTCCCC 2640 

AAACTCCGCG CGCAAATCCT GCACAAGGTC TCGTGTCTCC TGCAAGGTGC GGTCCACCTG 2700 
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AGCACGTGCT TCTTTTACTG TCTTGTGCAG TGCCTCACTT TCGCGCCGAC CATCTGCATG 
GAGCTGTCCT GTCACAGTAT TTACATAGCC ACCCAGTTCC TCTTTGACCG TCCGCATCGT 
GTCACACATC TTGCC TATTT CATCACGCAG ACTTTGCAAA GACTCACCAT TCTTCTGAGA 
AAAATCTTCA T AC TG CAT AT CATAGCGCGC ACTCAGGTTT TCAATCGCCC GCTCAGAAAG 
ATTCACCAAA TGCGCAATTT TTCCTTCAAA CAACTGCTTT GCATCCgCAA ACTGCTTGTC 
GGTGTGTGCC TTCCATGCCT CGATATCCCG TTTGACCGAC CCACAGCCCC CCTGCGCCTC 
CTGCTTTATC TCCTGCACGA GCACATTCAT ATC CCGa ACT TCGGTTTCAA TCATACTTGA 
GTGAACATGC AAAGAGTCAC GCAGgCGTTG CCgCACTGCT TCCAATTCTC TCTCAATGAG 
ATTATGCGCC TTATGTGCAA GGCCGCGACA TCTAAGGTA 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CAGGATnCCC CATTCCTGAG AAGAAGGCgC GCATCrCGmA GtCgACTGAC TACCCTTCCG 
GCAGCCTCCG GTGCATCGTG CCTCACCTTT TTTACCCGTG GACACATACC CCAATTGCGC 
AtTTCAAAAA GTCCGTTGAA CAATCGTTCG TCGTTTTCTT ACACGCAGAT GTGCAACAAC 
TACGAACGCA AAACATCACG TGGCTTGGAT CCATTCGGCG GACCGACCAC CCCCCTTGCT 
TCCATTTCTT CGATTAGGCG CGCGGCGcgA TTGTAGCCTA TCTTCAATTT ACGTTGCACA 
TACGATGTGG ACGCTTTACC CGCGTATTGC ACTACC TGCA CTGCCTGCTC GTATAAAGGA 
TCGCTTTCAT CCACAAAATT TCCAGATATA CTCGCGTCGT CATCGTCAAA GAAAATTTCT 
TCATCAAGAT ACTCAGGCGT TCCCCACGCG CGTACATGGG CGATCACGCG CGCTAATTCT 
CGCTCGGAAA CATACGCACC TTGAATCCGC GTAGGAAAAG ACTGACTCGG GTTCATGTAC 
AGCATATCCC CTCGTCCCAG CAATTTTTCT GCGCCCATCT CATCCAAAAT AATACGGCTA 
TCCATTTTAG ATGAAACCAT AAAGGCAATT CTGCTTGGAA TATTTGCCTT AATAAGGCCG 
GTGATGACAT CGATTGACGG TCGCTGGGTG GCAAGTACCA AATGGATGCC TACTGCACGG 
CTCATCGCGC ACAAACGCGC AACACTCGTT TCTAATTCTT TGCCAGAGGC AACCATTAAG 
TCTGCAAATT CATCAATGAT AATAACGATG AATGGGAGAG GCTGCGTGGC GATGCTTTTT 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3219 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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TCCTGTATTT TTTTGTTGTA GGTCTTAATG 
TAGCGTcGCT CCATTTCGCA CAGGATGTAC 
GTGATGACAG GAGTGAGAAG GTGGGCGATA 
GGATCAATGA GCAGAAGTTT GGTTTCGTCA 
AGCGCGTTTA CGCATACTGA TTTACCCGAC 
GTTTGGGCAA GGTCGATAAC CTGTGGTTCG 
ATGGCCATAC GGTTGCTGCC AGC TGTGCGC 
GATCGTTTTT TGTTAGGGAC TtTCCiuCCCT 
ATGCGCACGC TTGAAGCAGC AAGCTTGAGC 
GACAGTTTGA TGCCGGGTGG AGGGAGAAGC 
ATACCGGTGA TTTCTACTCG AATGTTGAAT 
AGATTCTTGG TGAGCTCGTC AATTCCTTCA 
TCGTACGGTA CTTGGTAGCC GCGGCAAGGG 
GGACGCGgTG GTCCCTGTTC ATCGTCTTGC 
GAGGGAGCAG AGATAGGAGA GAGGGCCATG 
GGCAAAGACC CGGGTGACGC GGGGGCGTGA 
CCGGGCGCTG GGAGCAGGGG GAATGGAGCC 
GGCGTGGACA CACCGCCACA CGCTGCCACt 
TCAAAAATTC TCCCCCCTGG AGGGGAACTT 
TTGCTTCGGG CGTCTGCACA TCTGCGGTGG 
TGTCAGGATG ATCGGCGGTG GAGGGAGGGA 
AATCcGAGGG ATACGTGCAT GAAACCATAC 
ATAGAGCTCT GCTCCCAGCA ATGCGAGGAG 
CCGCGTTGAC GGTGAAATTG ACCGTGCcAG 
GTTCTCCACA CACTGCAGTA ATGAACAAGA 
GTAACGAACG TCCGCCGACA AACAAGAGGA 
GCAAGGAGGA GAAAGCGTAC GTTTCGTAAA 
ATGCTCGGTG CAAGGTAAAA AGGGGAAGAA 
CGAATAGCAG TGTGCCGAAG gTAAGAGCGA 



557 

TCGCGGCATT CTAATTGCTC 
TGTAGTGCTT GGAGTGCTCT 
TCGTTGTAGA GCTTTAACTC 
GGACACTTGT GGTACAGGAT 
CCAGTTGCGC CTGCAATGAG 
CCGGTAACGT CTTTGCCAAG 
GTATGGAGCA GTTCTTTGAA 
ATGGCGTGTT TTCCAGGAAT 
GCAACGTTGT CCTGCAGATT 
TCGAACATTG TGACTACAGG 
TCAGAGAATG TTTCCTCAAG 
TATGTGTCCT CTGAGTACTG 
TgCCGAAgCG GAGCTGCTGA 
GCAGgAATAA GGGTTTCAGC 
AC AC AC GG TG CCTGTGCAGG 
ACGTCTGAGG GAAGGTTACT 
TGAGAAGGCA CCGATGGCGC 
GCGTGGCAGG CTGCACCTCT 
CCGTGGAGAA TTGCCCCTCT 
CGCAGGAGGG AGCGGGGGGA 
AGGAGGGGTC TTGGAATCCA 
GTAACACcTT TCCCCGTAAA 
GCAAAGGACG CATACGATGT 
CGAGTGCGCG TCGTAGCGCG 
GTGGGAAGGC AACAAGTGCG 
GCGCTGTGTG CAAGAGTAAC 
GGAGAGTGCC AGGTACGAAG 
ACGTGGACAG GGTCaGGAGC 
TAATTCTAGG TAAAGGGGAT 



AAGAAGCGCA 900 

TTTGGGCTCA 960 

TACGATTTTT 1020 

AGAGAGAATG 1080 

CAGGTGAGGT 1140 

GATGACAGGG 1200 

TGTAACGAGG 12 60 

GGGAGCGACG 1320 

TGTAATTTTT 1380 

ACCCTTCTTG 1440 

CAGGAGTGCA 1500 

GTCAAGCAAG 1560 

GGCAGGAATA 1620 

GGGGGCGACT 1680 

GATGACAGAC 1740 

CTGAATGAGC 1800 

C AAAGC AGTT 1860 

GCCTCAGAAA 1920 

GGGGGCGCGC 1980 

GGGGAAACGG 2040 

TCAGCGATGA 2100 

TGAGTGCAGC 2160 

CTATCCCTCC 2220 

TAGAGACCGT 2280 

CTTTCTGCCC 2340 

AGCGGCACGA 2400 

AACCAGTGTG 24 60 

ACTGCACTGA 2520 

CGTTCCATGC 2580 
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ATTGTCCTGA AC AG TTAATC TGTTAGCTTG CACGCCCTGC AGGCTACCGA CCCCGACAGA 2 640 

AGGAGCCGAG TGAGGGGAGg AAACAGGCGC GACCCAAtAT CTTTGTAACG GTAAGATGCT 2700 

TTGCGTTACA CTGnGACGGG CGTnG 2725 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CGGCGCATAC TGTAC CGC AT CCTCCTGGTA TGCGCTnTCA CCCACGGnTT CCACAGCCGG 60 

GCAAAGTTCG TCAGAAACGT GGGGTTCTTC CCAATCGTCA GGTAGGCCCC ATAACAGTGT 120 

AGTGTCGCCT CTACnTTCCC CTTGCGCTTA ACGGCAAAAn CTGCnTTCCC CTGACTCAGG 180 

TCCGCCTGCA GGTCCGCCAC CTTnCAGTCC GCATACAGTG CCGGGTGCTG CCCACGGCGC 240 

GTGTGGGTGG TGCGCATAAn CAGGGGAAAG GATACTCCCA CCGTGTTGGT AG TACGAAAC 300 

CCGTGCTTCA GATTGTAGGG ACCGGTGCCC ATAACTGCAC CAGGGGCCTG GCCATGACTG 360 

CCTACCCCCT TGCCATAGCT GATGCCCCAC TCAAGTGTGG CAGAGCCAGT TAGCTTCGGG 420 

GAAAACTCCT GTCCGAGCAC TCCCCCGCTC GCTCCTACCC CCACCACCAC ACACAGCACA 480 

CTCCCCCACC GCATGCACCC CATGCTACnT CACCCCCCCC CnGGnCCTGT CTAGTAGCCC 540 

CyTCAcCTTC TTTTCTAACA CTACTGCCCA ATCAAGGTAT CCAGCTGCTT AGACAGCGCA 600 

CGGTgTGCGC ATTGTTCGGA TCCAGTGCAa TCACCTGATG CAAATAGTAC TGCGCTTTGC 660 

GGAAATCCTT TTTCTTTCGG TACCATTCAT ATAACGCAAA AAGAGTGCGT CCATTGCGCG 720 

GATCTGAAAG CAAACTTGCA CGCAATAAAC TTAATC GCTC CTCCTCGTGC ACTGACAACA 780 

ACGCTTCATA GTACGAAAAT ATCGACCGCA GCGTACCACG CGCGGACGCA CTCCGAGCTG 840 

CAATCACCGC ACGGATCTCC CGGTAATGAC GCGCCTCATA CAACGTATCT AAGTACAAGA 900 

CGATGACCGT CTCAGACGGA GGCTGTGCCG AATGATATAA ACGCCGCGCA AG AG AAATC G 960 

CCTCcTGCGC ACGACCTGAA CCACTGTACG CGCGAATAAG TAATTCTTGA TGAGCCTCAC 1020 

TCGGATATGC GGTATTCAAA CgCTCCGCGC GCGACACTGC CTGTTCCCAG TTACCCTGCG 1080 

CCAGTTCATA CTGCGTCAAT AAACGGAGCG CTTGAGCGTT ATGCGCATCG GCGCGGAgCA 1140 

CCAGCGCGAT AAAGCTTcGC ATGTTTTTTT GTGCAAGCGA GTGACCGGTC TCAAAGCAGT 1200 
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GCTGCGCACA CGCAAGGAGC 

AAAAGTCGTG CGCAGACGCG 
GCAAGTACGT TTTATCCGAA 
TCTGaTACTC CCGcTGaGCC 
CAGGCTGTTG ACGCAACAGC 
CGCTGTGAAT TTGCGCCTGC 
AT AGGCGCGC AATGGGCAGT 
AGCAGATGCC TGCAGGATAG 
CCGCTTCAGA GAGCTCTCCG 
GAGAGTGCGG GGCCTGCGTG 
TCATCAGGAA GGCGTTTTTT 
GGTTCTCCAG GAATGAGCGG 
GGATGCGCGG TGTTGTACAG 
TGCATAAGCG CAnACACACG 
CGCGCTGTAT GCACGCAGCT 
AGAGTATCTT TGGATCCATC 
CAtGCCaCGC ACAGGaTGCG 
GGTACtGCGC GCGTCGGCAC 
AGCACAAAAC CGACACTCAC 
GAAACCCGTT TTTACCTCCc 
ACCGTACCGA CAACGTACGC 
TCAACGcgCA CGGCGCATCC 
CGAGGGCGAA GAGAGCGTAC 
GTGCGTGTTC ACTCTTCCTG 
CGCGCCGGTT GTCCTTCCTT 
GGGCGTACGC AGCGCACcAg 
CGCGCTCGTG CTCGGTCCGC 
ACAGTACCGG AGCGCGCGTT 
CTACCGCCCC TTTATCGAAA 



ACCTGC AC AT 
TCATTTTTGC 
CTGTCACGCT 
ACCAAGATAC 
GCTGCAACGT 
AAAAGCTGCA 
GCCAGGGCAG 
CAGGATGCGT 
TGCTGCTCGT 
CGTGCGCGCG 
C AAC AG AG TC 
ATACGTACCT 
CGGCACATGC 
CGGAGGTACA 
GCCTCGCGCA 
AGCTTTGTGC 
CCGCCACCTC 
CACCTTCGAA 
CCCGATGGCG 
TGcATGGCAG 
ATACCGGCGC 
CTTGTGCTCC 
AGGAGGTTAA 
TTTCTCCTTC 
CTGTTTCTCT 
CGTTGCGCGC 
TCGTGCACCC 
ACCGGGAATA 
AACTGTGCGC 



559 

CACGCGGATA 



PCT^Pl/ 



TCCATTCCTT 
CTGCAAAAGA 
GAATGTGTAG 
ACGGTTGCGC 
CTGCTCGGTC 
TCTCACCGCG 
CTTTTTCCCA 
GCAAATACCC 
TtCAGGCGTT 
AGCGGGGGAA 
TTTTCCACGT 
CAAGAAACAG 
GCCGATTGTG 
ATGAGGCAGG 
GAATTGCGCG 
CGGCGCCTGG 
GAACTGCGAC 
CGGTCAATGC 
TCGTGCC ATT 
ACCGCCGCTT 
CCACGCAAGA 
CGGTTTTTTT 
GTTGTTTCCC 
GTGGTGCGGC 
TCAGTCGGTA 
CCTGTACCCG 
CCTCTCTGTC 
GACGCTCACC 



f 13041 

CAAGCGATAT GCCTGCTGCA 1260 

TGCC ACCTGC GCACGCAAGA 1320 

GTCAAGGGAC GCGTGaGCCT 1380 

CAAAAGCGCC GCATTGTCCT 1440 

GGCGCTCCAT TCTTGACGCG 1500 

TTTCGGGAAA CGGTGAAGCA 1560 

CTGTACCAGC ATGCGCGCAT 1620 

TGCGCTGCGG TACCTGnACT 1680 

AAAGAGCAGG AACGGAAGGA 1740 

CTCGGCAATC TGCGTGTACC 1800 

TGATATCCGA AAAAAAATCC 1860 

CATCGAGCGC AGTCAGGTAC 1920 

CTTCCTGCGG ATATACAAGT 1980 

CGGTGTCAAT CCGGCCGGAT 2040 

AGAAGCAGTT TCAATCAAGA 2100 

TCGGTC CGGT ACGTCAAGCG 2160 

GATGCGGAAG AAACCGGATC 2220 

AACAGAGAAA AACGGGTATC 22 80 

TTTTAGTAAG CGCCCCCATA 2340 

TACACAAAAC GCCTGTTGTG 2400 

CTTTTCCCTT TACGCGGTGT 24 60 

CATGCTAGGC TGGCTGCCAC 2520 

GCGAGAAATC ATTACCGCCC 2 580 

TGCTGGTCCC TGTGCGCGGG 264 0 

AGCCTGCCCT CCGCTTTGGG 2700 

CCTGACACCC TCATTCAGCG 27 60 

CCGATGCAGT CCTTCAAAGA 2820 

GTTATGcAGC GGAGCGCGCC 2880 

TTCCTGTCG A .GCTGCTCTTT 2940 
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CTCCCCGTTG TCGAATCGGG CTTTCTCGAA CGGGCTGTCT CCAAATCCGG CGCAGTCGGC 3000 

ATTTGGCAGT TCATGCGCAA TAGCATCGCA GGATCTGCCA TGCGCGTGAG TGACTGGGTA 3060 

GACGAACGGC GTGACCCCTG GAAGGCTTCC GTCGCCGCAG TCAAAAAACT GCAGTGGAAT 3120 

TACACGCAGC TGCGTGACTG GCCCTTGGCC CTCGCTGCGT ACAACTGCGG TCTTGGCGCG 3180 

ATCAAGCGAG CCATTGCCCA GGCAGGAACC GCCGATTTTT GGCATCTGAG TGAGCGCGGc 3240 

TTTCTGCGCG ACGAGACAGT CCGCTATGTC CCAAAGTTCC TTGCGGTTGC AGAAGTACTC 3300 

AGCCGGAGCC ACGAGCACGG CATCGCCTGG GGAGCGGCAC ACACCCCCGA GGAGACCACC 3360 

ACGGTTACCG TTTCGCGCGC GGTAGACTTA AACCTCTTGG CACAGG 3406 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

TGATAGCAAA TTATC TGCTg AAAGGCTCAC AGTACAGCAT GTTGGTCCCA GTGGTTGTTC 60 

GCCTGGGGGC GtATACAAGG TGTGAGGGAG TTGGCATTCG GGGGGCGTGT GCGGAAATGA 120 

AATGGaGTGG GCTCTGTCTC TTTCTCCGGA CGGGGGGGGG GGGGTGCGAA CAGATGAACG 180 

GAAAGCGTGT GTTTCTGGGC ATTGTCgTGG TGGTCtGTGC CGCGCGCTGT TTTTGCGCcG 240 

GACGTGTTCT TCTCCTCGCa TCTGGGGTaT GGGCGTTTCT ACGCC cGTGG GGAAAGACAT 300 

TGAaGGGGCa GCACATGCAC GTTCCAAGCA TTGGCGGCGG CGTATGTGTG GTGGCArACA 360 

GCGGGTTTGC CTTCGCCTGC ACGGTGGACG CAGCCCTGAC CCGTATAATG CTGAAAACTC 420 

AGGCGCTCTT TGGCTATGCC TTTCGGTGGG GAGCGTTCAG CCTCATCCCC TTGCTTGGGA 480 

TGGATGTGAT TGTGTCGAGC GACCACGCGT TTGGTGTTGC CGCGCAAGTG TCGTTCCAGC 540 

ATTTGATTTC TGAGTGGTGG GGCTTTGCCT TGAGTGTGAG CGGCGGGGTG GACTTTCCGC 600 

TCAACCCTAA CACCCGCTTT TTAGCAGGTA AGCTGCCTGC AGAAACGGTG CAGCGCGTGG 660 

CsTCGTTGCG CTGCGGCAAA AGCTTATTAG CGAAnGGATT ATCAAGGCAT TGGATTTGGG 720 

CTGGTTTATT ACCTTCGCTC TGACCGTTGT TGCCGAGGGA TTCAGTTGGA TTGTGTCGCA 780 

GAGCGCTTGG ATTGCGCAGA AGGCGGTGAA TTACTTTTTG AGCGACACCA CGCGTTGTCT 840 

CATTCTCCCG GTCACGCTGC GGGCCGGTCC TACCTTTCGA ATATAGCGTG CGGGGGGGGG 900 
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CGGATTTAGC GGCGCGTTGG CGTCCGCTGT CGAGTTGCCC AGAGCGCGAG AAGAATGCGG 
TCGATTTCTT GTGTAGCGTT CCGTCGTGCG CCGCGCACTT GAAcGAGCTC AGCCGGTGCA 
TGAGCGAAAs CgCGTGGACG ACTGTCTGCC CGAACTTTCC CGGCAGtGCG GCGGCACcGC 
AGTATCCACT TTGAAAGGTA TACCATCCCC GAGGCCATGC AGGCCTGGCG CGCTCCTGCC 
GAAAGCGCAC CGCTCGTCAG GGGGACCCCA AGCAmCGTTG CCCACACGCG TCCAACCGGC 
GGTCTAAGGT AGCAGGCGAG CGTTCCCACC CACAGCCCTG CCAAGGTCAT CCAGCGGATG 
CGTCCCCCCk TTGCCGCATA GAGCAGCAtG CGATCCCGCA CACGGACGCC TGCACGCTCA 
GCCGCGGCAC ACACAGCAGT ATCAACAGGA GCAGGAGTCC TGTCCCCATC CGTACATAAC 
GCAGCACACG CGGCCTAAGC CgTGTTCCGA ACACGGCGCC GTGCATTCAC ACCCCGCCTG 
TACCGGAAAA AAAAG AG A C G CAGGCGGTGC GGTGTC TTGT TGTCCGCATG CCGCGTTCTG 
CAGGTGCGCG TACCACTGGG AGTGGGAGAG AAAATGGCCG GTTAGCGCTC CGAGTAAGGT 
ACTGCTCACT GCCCCCAGCG CGCAAAATAG CGGGAGTACG TACCACACCC CCGCGCCAAA 
AACGAGCACC CGAGCAAGCG TAAGCTGTAT CACGTTTGAA CAGAACGCAC CCATCACGCT 
TATCCCCACG CACGAAAGGT ACCGGCACGG GACGAACCGC AGCGCATACA TGAGCGCACC 
CGAAGCGGTG CTCCCTGCAA GCGAGAGGAC AAATACATAA GAAAAGAGCG TCCCACTCAC 
CAGAGCCTGC CCTATTACCT TCAGGAATAC TAAACGCGCG TACGCACAGA AAGGGAGCAG 
ATCCGGCGAG ATCAACAGCG GCAAATTCGC AAGCCCCACG CGAAAGAAAG GCAGCGGCTT 
TGGAATGACG TGTTCAACCG TAGAGAGAAA GAAACACATG CCGCCTAAAA GCGACACTAA 
CTCATCGCGT ACGTCTAGTG GCAGCCTGCT CCGCACGAGC CGCACCCTCC CGCGCCGCTC 
CCACAGCCAC CGCCGCTGGT GCTTTCCCGA AACAGAAGTG CAGCTAAGTC ATCGTCCGTC 
GCCTCTCGCA CAGAGCGCAC AGCCACCTCA AAGTGGAGCG TCTTCCCCGC GAGGGgATGA 
TTTCCATCTA CAATAATCGT TTCACCTTGC ACGTCAGTGA CGGTCACCGG TCGACTGTCA 
CCCCCGCTTC CTGCATCAAA CCGCATGCCC ACCTCTATTG GCACGTTTGG AGGAAACTGA 
TCTCGCCCCA CTGTCATGCG CAAGTCCTCC TGCACCTCTC CATACGCTCC TACCGGAGGA 
ATGGTTACTG AAAACTCCTC CCCCTCTTCT CGGTTAATTA AGGCGGTCTC GAGGCCAGGA 
ATGATCATGC CGTGCCCCTG AACATACTCG AGCGCACCCA TCACGTCGGA AGAATCGATG 
ATCTCCCCCt GTCATCTCgC AGGGTGTACT CGATgTTCAC CACACACTCA TTTGCGATTT 
TCATGCGCGG CATG CTAGC A CAGGCAAGAT aCTCACGGCA AGGGCAGTTT CTGTGCCGTG 
TGCCyTTGAc AGAATCGCCG TTATAGGGGA TAAGCCGGGC GAGGTGTTGG GAGCGTGTGG 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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562 


PCT^j 


013041 


TCCACTTCTT 


GCCCTCTTGC 


GCgGTGCTGT GCgGTAAAAG AgGGGGCGTC gCGTTCGAGT 


2700 


AAAATTTTCT 


CTTAAGCCTT 


AAGTGAGATA 


CCCCATTATG GTAGAGGTCt 


AACCGCGGTT 


2760 


GCGCGCTGCT 


GTTGCTCGGT 


TGGCGGTCTG 


TAGCGCTGCG GAGAAGGACG 


GTGCCCTGcC 


2820 


GCTGTGGCGA 


TGCGCTACAT GCGCAGCGGG AGGATATCCT GCGTGCAAAT 


GCGCAGGATC 


2880 


TTGCGCGGGC 


GCGTGAGGCG 


GGTCTTGCCg 


CACCGCTTGT CGCCCGGCTC GCGCTGAGTG 


2940 


AACACCTTCT TGAg GACATG TTGCGGTCTT 


TGaCTGTTCT TTCGCTTCAG 


CGGGATCCTA 


3000 


TCGGGGAAAT 


TATAGAAGGG 


TACACTCTTG 


CGAATGGACT GGAAATCCGG 


AAGGTACGTG 


3060 


TTCCTCTGGG 


GGTGGTGGCT 


GTCATCTACG 


AGTCTCGGCC CAACGTGACC 


GTAGATGCGT 


3120 


TTGCACTTGC 


GTACAAAAGC 


GGCAATGCGG 


TGCTCCTGCG CGCAGGTTCT 


GCAGCGAGTT 


3180 


ATTCAAATGC 


CCCGCTTTTG 


CGCGCAATTC 


ACGTGGGTTT GAAGAAAGCG 


CATGGTGTCG 


3240 


TGGACGCGGT 


GGCTGTTCCT 


CCCGTTTTGG 


AGGAAAAATA TGGTGATGTG 


GATCATATCC 


3300 


TCcGCGCGCG 


CGgCTTTATC 


GATGCGGTAT 


TTCCTCGTGG GGGGGCGGCG 


CTTATCCGGC 


3360 


GCGTCGTGGA 


AGGCGCCCAC 


GTGCCAGTTA 


TTGAAACCGG ATGCGGCGTG 


TGCCACCTAT 


3420 


ACGTAGATGA 


GAGTGCGAAT 


ATCGATGTGG 


CGCTGCAGAT TGCAGAAAAC 


GCGAAGTTGC 


3480 


AAAAACCGGC 


CGCATGCAAT TCAGTCGAAA CGCTGTTGGT GCATCGTGCG 


GTTGCGCGTC 


3540 


CTTTTTTGCA 


CCGTGTACAG 


GAGATTTTTG 


CCACCTGTGA GGAGACTACG 


CGCAAcCCGG 


3600 


TGGTGTGGAT 


TTTTTTTGTG 


ATGC TGAGTC 


TTTCTCCCTT C TC AC AG AAA 


GGGGCGCGAG 


3660 


AAAAAATGTT 


TTTCATGCAC 


AGGCAGAGAC 


CTGGGATCGG GAATACCTGG 


ACTATCAGGT 


3720 


ATCCGTGCGG 


GTGGTGCCAA 


ACCTTGAAGA 


AGCACTCAGG CACATTGCTC 


GTcATTCTAC 


3780 


GAAACACTCA 


GAGGTTATTG 


TCACGCGCGA 


TCGTGCCCGT GCGCGTCGTT 


TTCATCAGGA 


3840 


AGTAGATGCT 


GCCTGTGTAT 


ATGTCAATGC 


TTCAAGTAGG tTTACCGATG 


GAGGGCAGTT 


3900 


TGGCATGGGA 


GCAGAnATTG 


GGGTCAGTAC 


GCAAAAATTG CACGCGCGCG 


GTCCGATGGG 


3960 


TTTGTGTGCA 


CTGACTACTT 


CAAAATATCT 


GATTGATGGA GAGGGGCAGG 


TGCGTCCGTG 


4020 


ATC CGTGCGC 


TTTTTGCTGC 


GGCAAAAAAA 


AtTGTGATAA AGATTGGGTC 


AAATACGCTT 


4080 


GCGCAkGCAG 


ATGGTACTCC 


TGATGAGGAG TTTTTGGCGG wGTGTGCTCG 


CGCCTGTGCG 


4140 


GCGCTGATGC 


GTGACGGCAA 


GCAGATAGTT 


GTGGTGTCGT CTGGCGCTCA 


GGTTGCAGGG 


4200 


ATTTCTGCGC 


TCCATTGCCT 


TTCATCTCCT 


CCTCAGGGGG CGGGTTTAGA 


GCGTCACGAA 


4260 


TCGCGCGGCG 


TTATTCCGGG 


TGATGGTGCG 


TCCTGCAAAC AGGCGTTGTG 


TGCGGTGGGT 


4320 


CAGGCGGAgT TGATAAGTCG 


TtGGCGTTCT 


GCGTTTGCAG CGCACCAGCA GTGCgTGGGC 


4380 
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C AG TTTCTGT GTACGAAGGA GGATTTTACT GACTCGGACC gCGCGGCGCA GGTACGCTAC 4440 

ACGTTGTCCT TTTTGCTCGA GCGCAGGGTA GT AC CTATCC TTAATGAAAA TGACGCGCTC 4500 

TGTTGCAGCG ACGTCCCCTC TGTAmCCGCC GACCGGcGGt GTCCCTATCA CCTCAAAAAA 4560 

GGATTGGAGA TAATGACAGT CTGTCCGCGT TTGTAGCGCT GTTGTGGCAG GCAGATCTTT 4620 

TGCTTTTGTT GAG TG AC ATT GACGGCGTGT ATG AC AAAGA CCCAAAGGCA CACACAGATG 4680 

CGCAgcACGT TCCTCTGGTG ACGGACGTGT CAGCGCTTGT GGGTAAAACG AGCATGGGTT 4740 

CTTCCAATGT CTTTGGTACG GGTGGGATTG CTACAAAGCT GGATGCTGCG CGTCTTGTCA 4 800 

CGAGGGCGGG AATTCCTCTG GTGCTGGCAA ACGGGCGCCA TCTGGATCCG ATCCTGAGCC 4860 

TTATGCGCGG GGATGCGCGG GGGACACTTT TCGTGCCTGT TTCTTAGAGA GCGACGTGGG 4920 

TATGCGCAAG TGCACGCATT GTGCC CTATA ATGCGCGGCG TGCGGTCAAT TTCTGACGTG 4980 

TAATTTTTCT CGGTGGGGCG ACGTCTCCGT CTGTCTGTTA ATTCGGTGGT GTGTTTCGAT 5040 

GCGAGAAAAG GAAGGAGGTG TGGTGAACGA CGATTTTCAC TATGAAGTGA CGCGCAACTG 5100 

GGGCACGCTT TCCACATCGG GGAATGGCTG GTCCC TCGAA CTGAAGTCTA TTTCTTGGAA 5160 

TGGCCGGCCA GAGAAATATG ATATCCGCGC GTGGTCCCCA GACAAGAGCA AGATGGGAAA 5220 

GGGGGTaACg cTTACGCGTG CAGAGATTGT AGCCCTGCGC GATTTACTAA ACAGTATGTC 5280 

CCTGGACCCG TACTAGGGAC AGTCTGCAGT GCTTTGTGCA GcGCGGCGCg cAGc gTCGG t 5340 

GGCTAGCCGG TCGCACAGTT CGTTGTACGG GTCTCCTGCA TGTCCTTTTA CCCAGCGCCA 5400 

CTCGACGGAT AGGGC GTCGG CGAGTGCGCT GAGCGCTTCC CACAAATCCT TGTTCTTGAC 54 60 

CGGTTGTTTG GCAGCCGTTT TCCAGCCGTT GTGTTTCCAG GTATGGATCC ACTGGGTGAT 5520 

GCCTTTGCGT ACGTATTGGG AGTCGGTGAC CACTACCACC GCCTCTGCAG CGCGTCCGTG 5580 

TGCCTCTTGC AGTGCGTTGA TGACCGCGCA CAGTTCCATG CGATTGTTtg TGCTCGGGTA 5640 

GGCgCTGCCG CTTC TAGTGA ATGCGGCAGC TTCTGGTGCG GTTTGTCCGG TTTCTAGAAA 5700 

GGGTACGTCT GAGGGCACCA GAGCAAACGC CCACCCGCCC GGACCCGGGT TTCCCAGACA 5760 

GGCGCCGTCA GTGTACAGGG TAAGTGCAGC GTGCGCGTTC ATAGTCGCGC tACGGTAACA 5820 

GTTTTGCGCC GTGGGGACAA TGTATTGGTC CGACAGTTGG TGATGGAGCG AAGATATTTT 5880 

CGCAAGGAGG GAGAATGAGG CGCGCACGGA TTGTGCAGGA AC TTTGGTAC GCGGGACGAC 5940 

GGTTTGGTTT TTGCGGTACG CTGTCTTATT CTGCAAGGCG GTGTACACGT GCGCGTTGCA 6000 

CTTTCTCCTC GGGTGTACAT GCTGCACTGT TTTTAGAGGA AAGCTAACAC GGAGAGGGCA 6060 

CAGATGAATA TTCTGCATAA CTTTGTTGTA TTCGAAGGTA TTGATGGCAC AGGCACGAGT 6120 
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ACACAGTTGC GTGCGCTCGA ACGCCATTTT CAGGCCCGTA AGGACATGGT CTTTACTCAA 6180 

GAGCCTACCG GAGGGGAGAT TGGCACTCTC ATTCGGGATG TGCTGCAAAA GCGTGTGATC 6240 

ATGAGCTCTA AGGCATTGGG ATTGCTCTTT GCCGCAGATA GACACGAGCA CTTGGAAGGT 6300 

GCAGGAGGCA TTAACGATTG TCTTGCAGAA GGAAAGATAG TGCTCTGCGA TCGGTATGTT 63 60 

TTTTCCAGTT TGGTGTACCA AGGCATGGCG GTGTCGGGTA GTTTCGCGTA TGAATTAAAT 6420 

AAAGAGTTTC CGCTTCCTGA AGTTGTGTTC TATTTTGACG CGCCTATCGA AGTATGTGTT 6480 

GAGCGTATCA CCGCACGTGG GCTGCAAACG GAACTGTATG AGTACACGTC TTTTCAAGAA 6540 

AAGGCGCGCA AGGGGTATGA AACTATATTT CGCaAGTGCC gTCaTTTGTA CCCTGCAATG 6 600 

AAAGTGATTG AAATAGACGC GCGCGAGGAA ATTGAAgTTG TGCATGAGCg TATTCTTCAC 6660 

CATC TGC GCG AATACAGGCG TCTAAAATAG TGTGTGGACG TAGATACACT ATCTGAGGAG 672 0 

CAGTGGAGAG TATATATCAG GAACGTGCTT TGCAAGCGGA AGGCGCGTGC TCGGTAAAAC 67 80 

GGTGCTGCAC CGGCGCAgcA TaAGCAAAAT AATTGGAAAA TTTGTCCATA GGTTTTTGTC 6840 

GTCCGGTCAC AGTGCTCAGT GCCTTTTTCT AGGCTGTTTT TCAATAACTG TTTATGTAGA 6900 

CTGGACGGGT CTTCCTTTCT CAACTCACAT ATTCTTTTCG GGGACATGCT GCCGTTGGCA 6960 

GACGTTGGGT GTGACGGGTG TTTCTCTGGT GTGTAAGAGG AAGATATATT CCCCTTTTGT 7020 

ATCTGCACTG ACCCCTGCAC GGGGTACAGG CTATTGACGC TTCCTTTCGT CTGTGTGTCT 7 080 

TCACTGTTGC GTGTACGGCG CGTGAACGGG CCATATAGAT AGATGCTTGA CGGGGTCTGG 7140 

TTGCCATGTT AGGATCCACC AAGCGTGACT ATTCTTTTCT GGC CGCGTGT GATGCATAAG 72 00 

ACACTCCCAT AGCACCGTTA AGAGTCTCGC GAAACCTCCT CCGTATGGAG AGGGGTAATC 72 60 

CAATTGCCGT GGAACGCGAA GGTTCTGTGT TATGTCCGCA AAGATTTACG TCGGTAATTT 7320 

AAATTATGCC ACCACTGAGG CTGGATTGGC CTCCCTTTTT TCTCAGTTTG GGGAAGTGCT 73 80 

GTCCGTGGCT GTAATCAAGG ATAAGCTTAC GCAGCGGTCG AAGGGCTTTG GTTTTGTTGA 7440 

GATGGAAAGC GCAGAATCAG CCGAGTTGGT TaTTAACGAG TTGAATGAGA AGGAGTTTGA 7500 

AGGGCGTAnG CTTCGCGTTa ACTATGCGGA GGAGAAGCCG CGTTTTcCCT TTaAGAATTA 7560 

GTGGAGGATG GGGAGGACTT TcCATCGTGG CGCATGTTTT TgGCGTAAGG TGCTTTCGCG 7620 

TGCGTTa TCT CATTTcTCGT CGTCTTTTGG TTcTCCCCGT TTGTGTGCGT CGCGGTgTGT 7 680 

TTGGTTcCTG TTaGGAACCC CTTCGGGGcT TCTGTc TATT TTGcTCCCAA GACTGCTAkT 7740 

ACTATGGaTG agGcTGcGTC TCGCGyCCCA GGGTTgyCaw GwAgGGTGCC gTC t TTTGCG 7800 

CCTGGGTTGA AGCAAGGTTT GCCnGGAACG TTGGGTCCGT TGGGTTGAAC CCAAGAAAGA 7860 
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(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



565 

7874 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GTTATGGTCC CTGTTATTGG CGATTTAAGG ATGCTGCcgT ACGTGCrCTA TCTTA tmCGA 60 

AasTGCgCTG CGATGCgGTT TTTcATGCTG CCGCGTATAA GCaCGTTCCT ATGATGGAAC 120 

TCAATCCTGT TTCAGTGATT GAAAATAATG TCTTCGGCAC CAAATTCTTG CTCGATGCCT 180 

GTATTGCGTG TAGGGTTAAG CGCTTTGTAC TTTTGTCCAC TGACAAGGCC GTGGATCCTG 240 

TTTCTATCTA CGGAGTATCT AAGATGCTCA ACGAGAAGAA TGTCTTGTAT GCTGCTGAGC 300 
GTGTGCGCGA TTTCGGTCAC GATGCCGCGT ATATGTTTGT CCGTTTTGGA AACGTATTGG 3 60 

GTTCCCGTGG TTCTATCATG CCGCTCTTTA TTGAACAAAT AAAGAAAGGG GGGCCCGTTA 420 
CCGTGACAGA TCCTGCCATG ACACGATTCT TTATGACTAT TCCCGAAGCG TGTTCACTCG 480 
TTTTGCAAGT CGGTGGAGTA GGAGTAAATG GAGCGTCGTA TCTTTTGGAC ATGGGGGAGC 540 
CTGTGAGCAT TATGGAGACT GCGCAgcAAC TTATTCGCTA TTTTGGTTAC GAGCCAGACA 600 
GAGATATTCC TATCCACGTG GTGGGCTTGC GTCCTGGCGA GCGTCTCAGT GAGCCACTCG 660 
TTTCCAAAGA CGAGCGTATA GAGCCGACGG TATATCCAAA GGTTCTGCGT TTGCGTGAAC 72 0 

GTGAACCTTT GGATTTTGCG CACCTTGAAC GCCTGTGGGA TCAACTGTAT CCTTACTGTT 780 
TCCCTTCAGG AGAAAAGGTG CGGTACCGGC ACAAAGAAGG ACTTGTCCGC GTGCTATGCG 840 
ACTCGTGCGC GACACTGAAA CAGCGGTATA TGCCAAATAG CGAGGCATAG GAAAATGGAA 900 
GGTACCGTGA AAAAAAAGAA AGAGGGTGTT CGTGATGATA ACGCGCAGCA TGCGGTGTTC 960 

AACAAACAAG TGCCGTTTTT TGTGCCCTCG TTTTCTGAAG CGGAAGAGCG CGCAGTCTGC 1020 

GATGTGTTGC GTTCAGGATG GATTACGACG GGAACACAAG CACTCGCGTT TGAAAAAGAG 1080 

TTTGCckTwT gTGGG tGCTC CCTATGCGTG TGCGGTTAAC TCAGCTACCA GTGGTTTGCT 1140 

TCTC AC CTTT GATGCAATGG GCATTGGGCC GGATAGTAAG ATACTTACCA GTCCTTATAC 1200 

GTTTGTGTCT ACGGCGAGCT CTGCACTCCA CCTAGGTGCG CAGGTGGTGT ACGCCGATAT 12 60 

CGAGCGCGAC TCTTATAATA TCAGTGCAGA GTGTGTTGAA GCGTGTTTAA AAAAGGATGC 1320 
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GCGCATCCGT GCTATTGTAC CCATCCATAT TGCCGGGAAT GTATGCAATA TGCGTGATCT 13 80 

CAATGCTCTT GCGCGTAAGT ATCAAGTGGC AGTGGTGGAA GATGCAGCAC ACGCTTTTCC 1440 

ATCGAAGACT GCGTG TGGGT ATGCAGGCAC ACTGTCACAT GCGGGGGTAT TTTCCTTTTA 1500 

TGCCACCAAG CCGTTAACCA CCGGTGAAGG AGGTATGGTT TGCACAAATG ATGCGAAgcT 1560 

TGcAGCGCGT ATTGCGTGTT TGCGTTCACA TGGCATTGAC CGGGCTATTT GGGATCGGTA 1620 

CACAAATGGC ACCGCACCGT GGCGTTATGA CGTAACAAGC CTTGGGTGGA AGTGTAACCT 1680 

GCCGGATATT TTAGCAGCAA TTGGACGCGT ACAGTTGCAG AAGGCGGCGC ATCTTTTTGC 1740 

ACAACGCGCG CGTATTGCCG CCGCGTTCAC GCGTGCTTTT TCTCGTTATG AATTTTTTTG 1800 

TACTCCGCCT GATGGGGATG GAAACGCGTG GCATTTGTAT TTGTTGCGCT TAGTTCCTGG 1860 

AACGCTTTCT GTTTCTCGGG ACGAGTTCGT CAGATTATTG CAGGAACGGG GATTGGGCGT 1920 

TTCTATGCAT TTTATTCCTC ATTTCGAGAT GACGTTTTTT AAGAAAAGTC TGTGTGTACG 1980 

AGCGGAAGAT TTCCCTGAGT GTGCGCACAA GTATCAGCAC AcGcTTACGC TTCCGTTGTG 2040 

GCCGGGAATG GATGACAGTT GCG TGGCGTA TGTGATAGAG ACCGTGGTGC GCACCGCACA 2100 

AGAATGTGCA AAGGGAAGAG CATATATATG AGCGTGTTCG TTTCAGACGG TGCGCGCACA 2160 

GGGAGCGTCT ATGCACAGCT TGTCCGTGCG CCGCGCGTTG CAGGATTGCT GCTGAACATA 2220 

GATATTCCCT CTCTCCtGAC GGGTACTCTT TTTATACTGC AGCACATATT CCCGGATGCA 2280 

ATGCCGTTCG GTGTGGGGAA AATACTGTGC CGGTTTTTGC GCATGGAGAG GTGGTGTACG 2340 

CAGGG aACCG GTGGGTATCC TCATTGGGCC TGATGAGCAT GTGGTACGTA ATTTAGTGCA 2400 

AGATGTGGTG GTGCATACGT GCGCAGAGCG GGCCTGTGCG TCGGAAATAC TCTGTGGAAT 24 60 

CAGTGAAGGG GAACCCCTCG CTCAAAAGGT GGCGGTGCAA GGAGATGCAG AAACTGCTTT 2520 

TAAACGCGCA TCACACACGG TATGCTCCTC TTGTACATTT GAGCCGCGTG TACACTACTT 2580 

TGCGGAAATG CCAGAAGTAC AGGCACTACC CGACGCGCAC GGTCTGCACG TGTACGCTGC 2 640 

TACGCAtGGc CTGCGCACAT GAGAAAAACT ATCGCGCAGg TACTGAATAT TTCTGAGCAT 2700 

GCGGTGCACG TACATCCGCA GCAGGAAGCG CTTTCCTGTG ATGGGAGAAT ATGGTTCCCC 2760 

TCAGTGATGG CAAGTCAGGC GGCGCTTGCA GCCTATTGTG CGAAAAAGCC GGTACGCTTG 2820 

TCTTTTTCCT TTCAAGAGTA TGTGCAGTAC TGTCCTAAGA CTCCCAAGAT TACCATTGCA 2880 

CATCGCACGG CGCTCAACGC CGCGCATGCG GTAGAAGGTA TGTTTGTTTT TATCTCCCTC 2940 

GATGCAGGAG CGGGGAATTT ATTGATCGAT CGTATGGTTG CGCATATGGT CCATACTGCA 3000 

TTAGGAAATT ATGAAATTCC TCGGTACCGC ATTGAATGCA CAGCGTTTCG TTCAAATGTT 3060 
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GGATTAACGG ATGTTTTTAA TGGATGGGCA GATGCATACA CTTCTAATGC 
CATATTAATC AG TT ATGTGC TGAGCTTCAT ATATTCCCTG ACGAGTGGCG 
ATGAAAGATA CGCGGGAAAC ACAGCGTTTT GCGCGGTTGC TCGCCTATCT 
GGAGATTTTC GTCGAAAGCA CGCAGCCTTC AGCATGGTCA ATGCAGTACG 
GACACCCATG CCTGGCGTGG TATTGGACTC GCGTTGGGGT TTC AATATG A 
ATGTTAGCCC GTTCGGGTTT TTCCTATGTA TTACAAATGA CGCTGCACAC 
ATTGTGGTGC ACAGCGTTCC GCTTTCTGAT TCGTTTAAAC GGGTAGTGGT 
ATCAGAGAGT TTGCGTGTCT GGAGGATGCG ATCTTTTTTA AAAGTAGTGA 
GGCGTGGATC TGTTGGGTCC GTCTGTGGAA TCAGTGGGGA TGAGGGTGTT 
GTGAGAAAGT GTGTACGAGC AATTC AG AG A CAACGCTTCA GAAAGCCACT 
GTACAGGGGT CCTTTAACAC GGCCAAGAAG GGGCAGGTGT ATCAAGTGGT 
AAGTCAGATG TTTCGGTGCC CGATGCGCAA TCTGAGCAGT GTGCCTCAAA 
ACTGCTGATA CTAGCGGAAA ATGTGAGGAT ATGAACGGTT TTACCAAAAT 
AGCACGCACA CTCCTGCAGC CTGTATTATT GAACTCGAAT TAGATGCGTT 
CCTAAGATTG TCAGGTTGTG GTTTGTTTGC GATCCTGGGT ATGTCTTTTG 
GTGTACCGTA CCGTGAGTCG AAGCATTACT CGTGCGCTTT CGCACGTATC 
ATTTGGGAGC GTGCGCGCAC ACCCGAGTAT GTTATCATCG ATCCATCCGA 
TATCACGTCA CC CTTTTG AG TTCAAATGCT GCTGCGCGTG CGGTGGGAAC 
GtATTGTTCC TGctGCGTAC TACGCAGCAC TGCGGCAAAT TTTGCCGATT 
CTACCCATAA GGTTCCTTTT GTTGCGCGGG ATATTTTTTA TGAGATGTTT 
CAGACGATTC TCTATGAATA TTCGC TTTAC GTTGAATACA GAACAGGTAC 
TATGCC tC AT GAGCGTC TTT CGACCGTTTT ACGGAGATGT TTTCATCTTC 
AGGTTCACAC GGACATGGAG AGAATGGCGC GTCTACCATT TTGTTTAATG 
ATCCGCGTAT ATCATACCCT TTTTTCTTGC GCACGAAACA CAGATAGTTA 
TTTTCAAAAG ACTAAAAGAG GACGGTGGAT TGTTGCGTGC TTTGCGCAgC 
CTTTTTGTGG GTATTGCGAT GCAGGAAAAA TTCTCACCGC AGAGAGCTTA 
ATTCTGTGCC CAACGAAGAA GAGGTAAGAC ACGCCTTTTC AGGCATGCAA 
CTGATATCAA TGCGTTGATT CGAGCACTTC AACGTATGCC TGCTTGTCAT 
AAAACTGTAT GCATCTGTAT AAGAACTCAG AATCAAGTGT TATTTATTAC 



PCT 

ATTAGAAATG 
TGTGGCGCAC 
GTGTGAGGAA 
AAAAGCACAT 
TCCGTCTGCG 
TGATGCGCGC 
TGCGTTTCTC 
TGAGGCGTAT 
TGCACGGTTG 
TCCTATCACG 
GACTGTTGCT 
GGTACCTGTG 
GCACGGAATG 
GTGCGTGCAA 
TGAAAAAGAT 
TG TAG AAAAG 
TACTCCTCCC 
GGTTGCCGAA 
TCTCAAAACG 
TCCCTCAGTG 
ATGTGGATGC 
CTTCGATAAA 
GGGAGGCAGT 
CACTTGATTT 
ATgCATATTT 
TTACAAAGGA 
TGCnGGtGTA 
GAGTTTTCCT 
GTAAAAAGTT 



3041 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 
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TGGGTCAATT GTGTGCGGTA CTACGTAACG TCGCGCAGGT ACAGCCAGTT GGGGGTGGAA 
CGGGCTTGGT GCAATATCAG ATAACCCCGG TTTTAACGTT GC CTTC AC AT TTGGTTGTTC 
TAAACGGTGT ACCAGAGTTG AAAGATATTT CTAAAACTGA GCACTTTCTT GAGTTTGGTG 
GTGCTGTGTC GTTGCAAGCA ATTGTGCGAT TGGGAAGAAA AAATATTCCC GTGGcACTGC 
ACGAGGCACT GTCGCACGCA GCAAATCCTG GGATACGGAC TCTGGCCACT ATTGGGGGGA 
ATATTGCAGG TACGCGTCCG CATGCTTCTG CTCTTGCGCC GCTTATCGCG CTTGATGCAA 
AAATGGAGGT GCGGACTGGA CATGAAAACT TTTGGATTTC TGTGGCACAC TATGCACATG 
CGCGTTCTGA CAC GCTGCG A CACCGGAGTC ATGTAATTAC CCGTATTCGC CTTCCAACAG 
ATTACTGGGA CTTTTCCTAC TACAGACGTA TTGGGTCGCG TGCATTATTT GGTGAACGTG 
CCGATTTCGT GTTTCTTGCA CAGCAGCAGA AAAACGCGTT GTCTGAAATG CGTATGGTAT 
TTTTTTCAGA TGTAGTAATG AGAAATAGAG AATTTGACAA TTtGCTGTTA GGCAGAGCGA 
TTCCTCTTTC TGCAGGGGAT ATTGCGGCAA TCGTATATCG AAGCAGAGAG TTCTTTGCGC 
CTGAATCCTT TAAGAGTGCG TACATCGCGC ACTGCTTCTT TCATCTGCTG GAAGACTGTT 
TGCGCCGCTT AAGATGAAGC TACAGGTGGC GAGTTTTACC CAGGCACGCG CAAACAGcTG 
ACGCTAAAGA CCGAGTTTTT TCATTTTGCT TTGCAATGTA CTTGGCTTAA GACCAAGAAT 
TTCTGcTGCG CCATTTGCTC CGTATATCTT ACCGTTGCTT GCATCAAGCG CCGCTTGAAT 
TGCTGcGCGT TGCGCCTGAT GAAAGTTGAC CACCAGCGTA GTTTCTTCCT TTTTCCCCTG 
TGTGCATCGC ACCATGCAAG AGCTATCGCG TATGCACAGA GGTATAGCTG GTTGAACGCT 
TTCTGACACT TCGGTGTGCT CCCGAGGATA GACAGTAGTC TGCGTGCCGG ATTCCGGGGT 
CCTACATACA AGGTGTTCTG CGCCGATGGT ATCTCCGCGT GC AAGAAGTG CAGCACGCTC 
GAGTAGGTTG CGTAACTCAC GCACATTGCC AGGAAACGTG AGCGAGAAGA TTTTCTTAAA 
CGCGCTGGGA GAAAGCTGAG TGCGCTCAAA CCCCGGGCGG GTCTTAATTT TTTGGATAAA 
ATGCTCCGCT AGAAGCGCAA CGTCTTCTGC ACgCTCGCGC AAAGGGGGGA GACTGAGGGG 
AAAAACATCG AGCCGGTAGA GGAGGTCTTC CCTGAATTTC CCTTGGGTGA CTGCTTCTGA 
AAGGTTGATA TTCGTAGCTG CAATAATGCG AACCGAGACG CTTACCGAAC GCTCCCCTCC 
AACGCGCTCA AATACTCCGT CTTGGAGTAC GCGGAGGAGC TTCGGTTGCA GTTCCAGGGG 
GAGATCTCCG ACCTCATCGA GAAAAAGGGT GCCACCGTGA GCCAGTTCAA ATCTTCCCCG 
ATGGGTGCCG ACCGCACCTG AGAAGGCACC TTTTTCATGT CCGAATAATT CGCTTTCTGC 
AAGGCTATGG ACGAGTGCTG AGCAATTGAC GGGGACGAAG GGCTTGTCGC TGCGGGTGGA 



4860 
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PCT^B 


^13041 


AAGTTGGTGA 


ACGGTTCGCG 


CAACAAGCTC 


CTTTCCAGTG CCGGTTTCTC 


C AC AAACAAG 


6600 


GACAGGGAGG 


TCAGAGGCTG 


CTACGAGCTT 


TATAGCATCG 


AGTGTGCGTG 


TCCAAGCAGG 


6660 


AGAGGTTCCG 


ATCATATTTT 


TAAATGCAGG 


TGATTGGGGA 


GCTAAGAGCG 


CATTTCGTTC 


6720 


GGTCAAAAGA 


GCGTGACTCT 


TTTGACTCAG 


TGTCTCGGAC 


GCGTCGGTCT 


GGGCTACTGC 


6780 


GAGCGAGATA AGTTTAGAAA GAGTAGTAAT GAAGCGTACA 


ACGTCTGGGG 


TAAACTGCTC 


6840 


GCACAGGCGA 


TGGTCGAGCG 


TGAgCATGCC 


AATGGGAGTA 


TCATCGATGT 


AAAGCGGCGC 


6900 


GATGAGACAG 


GAATGATTTT 


GGGGCATGGG 


AATAAGCTCT 


GTGTAGGTAT 


CAGTGTGGGC 


6960 


AAGCGTCGGA 


TCGAAAAGGT 


ATGGACTCTT 


CTGTGATAGG 


ATGCGCGCAA 


GATCCTGCCT 


7020 


TTTGGTGAGG 


TCTATGgTGT 


GGTGTTGGAG 


GCGGGGAGTG TAC AGGGG AC 


CACGCGCCTT 


7080 


GCGAACTCGC 


AGTATTTGAG 


AAGATTCAAA GCTGAGGACC ACGGCTAGCT CgTAACGGGC 


7140 


AATCTCATAG 


AGGcGTCCAG 


AATCATTtCC 


AGCGACTTTT 


CCGCAGcAGG 


GGGAGAGCGC 


7200 


GCGTGCAGGA 


CAGCCCGAAC 


AgTTCATGGG 


GCCCGAGTAT 


AGAAGAAAAA 


gGCATATCCG 


7260 


TGCAATTcTC 


CGCATGGAGC 


CTGTGGGCGT 


GTCGTGTGCA 


GGgGTATGGT 


ATTTGTTTTT 


7320 


CGAATCCTTT 


TCCtCGCGTT 


TTTTATGGGG 


TATAATCGCG 


CGCATGAGAC 


GCGTGTGGAT 


7380 


AAGTGTTCTG 


ATGTTTCCTT 


GCGTATGGGC 


AAATGCGCAG 


GGAGAATTTC 


TCGCAGGCGG 


7440 


yGCAAAGGGA 


TTGTACCGTA 


TTACTCCTTA 


CGCTCAAGAC 


GTACTGCTCT 


CTGGCGTTTC 


7500 


GGTTAGCAAG 


ATTATTGCTG 


CGGGAGAGAA 


CTGGTTCTTG 


CTTACGTCTC 


GAGGTGTCAT 


7560 


GACCTCGCGC 


G AC TTAAGG A 


CTTTCGCGCA 


CGTGGGTGAG 


CAACTACCAA 


AGAAGGTAGT 


7620 


GAAGAAGATA 


GTCGATCGGG 


AAAAGGTTTT 


TGTGTCTCAG 


CCGCAGCCAT 


TGAAAGATCT 


7680 


TGAGGTACAT 


CCGGATAACG 


GAGCGGTTTT 


GGTTACCGCT 


ACCAATGACG 


CAGTGTTTCT 


7740 


CAGCAAAAAT 


GGGGGACGGA 


CTTGGCAAAA 


TCTGGGCTGT 


AATGCAAACA 


GCAGTGGGAT 


7800 


TAAGGCGGTc 


GCGGTGCTCG 


ATTTTCCTGA 


TGAAACGGGT AAGCCAGTGC 


TTACCGTGTT 


7860 


TGTTTCGCAT 


TCCCTGCGTG 


GTATTGCGTG 


GATGCAGCCA 


GAGAAAGGTC 


GTTTTTGGAC 


7920 


TGATATTnAn 


GCcTwCnCTT 


GCGCTTGGTC 


CTGAAGCCAC 


TGAAGAAATC 


TCAGACATTG 


7980 


CGGTGCGCAG 


GAGCGTGCAT 


GGCAATGAGC 


TTTTTGCAAG 


CTACACGTTC 


GTGCCCAAGA 


8040 


TCGTACGCCT 


TAACTGGGCC 


AAAAAACGCT 


TTCAGGACGT ACGTGTGTGG 


AnCGntGCGC 


8100 


TGAAAGATGC 


GCGCTGCATT 


GATGGATTGA 


GTGCGTCTGn 


CnTTCGCTCG 


TTGGGTGTCG 


8160 


GGATGGTAGT 


TTGTTTGAGA 


TCCCCCTCAT 


TATGCCTCGC 


CCCTTCGATT 


TGGCGCGTCT 


8220 


TGAACAGGAT 


TTGCGTCGGA 


TCCCGGATCA 


AATCTTATGT 


GCGTGGGTTC 


CGCGTCATGT 


8280 
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GTCACAGACG GGTGATGCGC 
TCTTGCAAAA GAGGGACGTT 
GCATCACATT AAGGATCCTA 
ACTTAATATG CTTGTGTACG 
AGATCCATTT GTGCGATCGG 
GAAGCAGGCA AAGGAAAAAA 
GTATTTGTTC CGTTGGAATG 
GGGGTATAAA AACGGAGCGC 
CAACGAAAAG GTGTGGCGGT 
CGATGAGGTA CAGTTCGATT 
GGAGTATCCG GCAAGAGAGT 
GTACGCGCGC GAGCaTaGAC 
AC CGTAC AGG CGCGCGCACG 
TC TGCCCG AT GTTCTACCCC 
AGGAGCGTCC CTATCGCATC 
CCGGGTGGTC ATCCGACCCT 
TACTACGGCG AGGATTACGT 
GGATACACGT ACTGGAACAA 
CTCCGTTAGC CGCCAAGGCA 
ACAGGACCGA AAATCAGCGT 
CCGCAGTCCT AAAAAGCTCG 
CTATACCGAA ATCCGAGACC 
TTTTCTACAT CTGGCAGACC 
CCCTACCACA GCGTTGACAA 
CCACCCCCCC CCTGCACCGA 
GCACACGCAA AAGACCGCTA 
TCAAGGATCT CCCGACGTTT 
TGCAGGTGCG GACCAGTGAC 
TCTTGATCCA AAGAGTCGAA 



TTGCGCGGGC 
AGCTTCGTGC 
ACATGAAGGA 
TCGGTGCAGT 
AGTTGTACCT 
GCTTTGAGCT 
TCCGTAAGGA 
ACAATGTCGC 
ATATTCGGTT 
CCGGGATGGA 
GCGCCAATCT 
GGCCAGGACG 
AGCCACTTTA 
TACTACTACG 
GGGTGCAGGC 
GCAGCGCCAG 
CTCAGGGCGT 
ACCCGGCGAG 
CCGTCCTCTC 
CTGCTGTCCT 
AACTTGGCCC 
ACTGCAATAC 
CCATAGCAAA 
TACCGCACAA 
CTCGGTGAGG 
C TC AG AAC TC 
GTCAAGTGGA 
GTCAACCGGA 



570 

TGAGTTATGG 
AGATTTGAAA 
AATGCACTTC 
TGAGCTTGGA 
TCGTCCTTTT 
AATAGCACGT 
TGCGGTTAAG 
AGAAATTCAC 
CATCGCGAAA 
CCCTACCGAC 
TAGGGAGAGT 
CCATTGATAT 
TGGAACTGCT 
GCCAGAGCTT 
GCGGTACCGC 
CTTCTAC t AC 
GTGGCTGGCA 
TACTCAGACG 
CGGACGCCCC 
AGGAGAAkmA 
CGTCAAGGAT 
CTTCGGGGGG 
TCGCGCAGGC 
CTCCCTCCGT 
GCTACGCGAG 
ATTTTAATAA 
TTAAACAGGA 
GAAAGGGTCA 
CTGTCAGAAA 



CTTTTGCACG 
AAGGGTATCT 
AAAACGATTG 
ATGGTGCGTT 
GTTGATATGA 
ATTGTGGTGT 
GCGGGTGGTA 
GAGCATTGGG 
GAAGCAATTG 
GGAGATAACC 
GCGTTGATGT 
CTACGGAGCG 
AGCTGAGTAT 
CCTAGCTTAC 
AAC CGGGTGC 
CGGTCTCTTA 
TCCGCGAATC 
TCCGGCCCGA 
TTGTTGGGCC 
CTTCTCTAGC 
AAC AAC C ATT 
AAGTCGATTC 
GCGCGCT tCG 
TTCCGTCAGT 
TTC AG TC AAC 
CCTCCGCTTT 
GACTGATCTC 
TTATGAAAAT 
CACCAACCCC 



ACCGGTCTAG 
ACGTACCGGC 
CGGACAATAA 
ATCAGTCGCA 
AGACGTTTGT 
TTAAGGACAA 
AGCCTTGGCA 
TGGATCCGTA 
AGTTTGGCTT 
TTCACCAGGC 
CGTTCCTGGC 
AACGGGTGGT 
GTGGATGTGA 
GCGCCTGCGC 
TGGCCCGCAA 
CGACCGGGCG 
AATCGATGAA 
CGGCGCGCGC 
GTTTTCCCAC 
TCGCCTATGT 
GGAAC GCT AC 
AGCAATCTTT 
CCTCGTCTGC 
CTGGGGCTTA 
TCGGCGCACA 
ACTTGTTGCA 
AGCCAAGAAC 
ACGACAAGGT 
TGCAACCAGA 
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8340 
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PCT^P 


1/13041 


CTACTCACTG 


AATTAGTTTT 


ACCGTGGGGT 


ATGGCAATGC 


CATGCTTCAT 


CCCCGTAGAC 


10080 


ATTTTTCGCT 


CTCGATCGAG 


CACACACTCC 


CGCGCAgcAA 


CCTTGTCGCT 


CACCCTTCCT 


10140 


GCACGGACGA 


ACATCTCGAG 


CATTTCGTCG 


ATGATCTCCT 


CCTTGGTAGA 


ACCCTTCAGG 


10200 


TGCAGGCTTA 


CGGTTTCCGG 


CGTCAACACG 


GTCTCCAAAT 


TCATTCCCCC 


AAGCTAAAAA 


10260 


CTCTCAAAGG 


AAAAGTCAAG 


CTTTTTGGAA 


AAGCTCCCCA 


CGCTCTTTCC 


GCTCCCAGGA 


10320 


TATCTTGACC 


TTTTGCCTAC 


TTGGACTTTA 


CCATGCGCGC 


GTGGAGTTCG 


CCACCAGGGA 


10380 


GCAGCTGAAT 


AGGTACTACG 


ATTTGTACAA 


GGATGTCGAT 


GTAACTTTCT 


CAAAGGATGT 


10440 


GATGCAGGCG 


CTCTGTTTTA 


ATGCGCGGCA 


GGTGTGCGTG 


CGAACGCGGn 


GAGGTCAGTG 


10500 


TTCCTGCGTA 


ATGAATTCTG 


TGTCTATGGT 


GGGTGCGAAA 


GTTATTCTCA 


GCAGGAAGAG 


10560 


TAGTCTGCTT 


GAGAGCATTC AAGTGGAAGG GGCGAGCGTC AgCATACGGT 


TTTCTTTCTT 


10620 


TGAGTCCGAT 


GCGCGGGATG 


CGGTTTCCTT 


CTTCGTTACT 


GCCAGGGTTC 


TCGGTGTTGA 


10680 


AGACTATGCC 


CaGAGTACGG 


AGCTAGTGGT 


GTTAAGCGTG 


GCGTATACGC 


AGCGCATACC 


10740 


TGATATGCTC 


ATAGAGCGTT 


TGGGTTTGCT 


TGTTGAGGCC 


AAC ATT AG TT 


CCAAGAAGCG 


10800 


TAAGTCGGAG 


CGTATTGCGg 


TGAACAAGGA 


GAGTATGCGC 


AGGATCGGCT 


TGATGAGAGC 


10860 


GGAGACCATC 


GTGTTCATTC 


AGGCGATTCC 


TCGCCGCTGC 


GTTCTGCGGG 


ATGTTTCCTT 


10920 


TGGTGGTGCG 


AAGTTTATCA 


TGATGGGCGT 


TGCGCCGTTT 


TTGAAAGGCA 


AGGAGACGGT 


10980 


GCTGAAGCTT 


GATTTTGAGG 


AGCCGAGTAC 


GAGCATGAGT 


ATTAGGGGGC 


ACGTGGTGCG 


11040 


TGCAGATCAG 


GTTGAGGGGC 


GTAAAGACCT 


GGTGGCCGTG 


GCCATGGAGT 


ACGACTTTGA 


11100 


TGTGGTGCCT 


GTCGCGTATC 


GTATGTGTTT GAACCgsTAC 


GCATCGGACC 


gCTGTCGCCG 


11160 


TTTTCCCGGT 


ACGGACGAGG 


ACTGCTCTGC 


GGCGTCTGCC 


GGCGATCCAG 


GGCGGTCGTC 


11220 


AGCAGGCGCT 


GAAGGTATTG 


ACCTTTCTGT 


ACCCTTCTCT 


TTGTCTTAGT 


TTTAATGGCG 


11280 


CTTGTCACCG 


GATTGCCCCT 


TAGGGGGGTC 


CGCATGTCTG 


CGGATGCGCG 


CGAGGCTCCG 


11340 


TGCACTGAGC 


CTCGCTCCGA 


CCAGTAGTTT 


CGAACGCACA 


AACCAGCCTC 


GTgGCACTTC 


11400 


CAGTGCGTAC 


CGTACTG AC C 


GGTTGCTGCG 


TACGCTGTGT 


GTGCTCAGCG 


GGACTAAGTC 


11460 


TTCTATCTGT 


ACGATCGCCC 


CTTGAGCATC 


CAGGAATGCG 


AGAGAGAgCG 


GGTGTGGGGT 


11520 


GTCCTTCATC 


CAAAaGGAGA 


GGCGTGTGTC 


CTGTTTATAC 


ACGAAAaGCA 


TGCCgTCCcG 


11580 


TCGGGGATCC 


GTGTACGCCC 


CATGTACtCg 


CGnCCTGCGC 


TTCTTCCGTG 


AGTGCGAGTT 


11640 


CTACAACCAC 


CGGCACGTAC 


TGCCCTCCTG 


TACAAAAAGC 


GATTTGCGCT 


GTTTCTAGGC 


11700 


TATTCGTTCT 


GCACGCCACA 


CAGGAAAGCA ACCCCAGTAA 


C AAAAGCG AC 


AGCGCa gCAG 


11760 
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572 




PCT«I 


f/13041 


GTGTCCTGTG 


CAGGTAGAAG 


GAGACGTGCT 


CTTCAAAAGC 


CTTGTACAAA 


ACGCTTTCGA 


11820 


TCCTTTTCAG 


CATTGCTCTT 


TTGAGTGTCG 


CGTGCACTAA 


GCTGTTCGTT 


AAAGGCGCGG 


11880 


ATATCTATGT 


ACTTGATAAT 


AAGAGGCCGC 


TC GAGC ACTA 


GGCGTGTGGA 


CGCGTCTTCC 


11940 


CACAAGGCAA 


CCCGAGGACT 


GAG CTTATG A 


GGTTCGCCGT 


ATTTTTTACA 


CAAGTTCTCA 


12000 


TATACCGAGT 


AGTAATCTAT 


CGCGTCAGTG 


TTGAGTTTGA 


ACGTCATAGC 


ATATAGACGG 


12060 


TCACGGTAGA ACTGAAACCA GCTGCGTGCA ATAAAGTGGG 


GACCCGTGGT 


CTCAATCAAG 


12120 


ATACGGTTCT 


CACTCATTAG 


CAGTGACACG 


TCACGCTCGC 


CGCGATATCC 


AAAAATACTA 


12180 


TC C TTTTTC A 


GTGCTTCCTT 


TACCTCGGTT 


ACGCCCATAC 


CCAAACTCAG 


CGCGCGGTAC 


12240 


ACGGCGGGAA 


TTTTCTCAAG 


AGAATGTGTC 


TGCGAAAGAG 


GAGTGACTGT 


AGCCAGCATT 


12300 


CTCGCACCGC 


CTGCGCCGCG 


ACTAGGGGAA 


GGGGAAGAAC 


ATAAATAAGA 


CTGCATATGC 


12360 


AACACCCCGC 


CCAACGCAAG 


CGCCATACAA 


TTTTTAGCAT 


AATTCATGCC 


GTTTTCCTCT 


12420 


CCTTGTGTGG 


ATACATCTAC 


TGCCTGTTGT 


GTGGGTCCTT 


GCGTGCCTTG 


GCGAGGGTAC 


12480 


GCTTTTTCTT 


AAAATAGGCA 


ACGCGCTTCA 


CCAATTCTAG 


GTCGTCGGTG 


ACAGATATTT 


12540 


TGTTATCGAC 


ACAGCGGACA 


ATAGGTTCTC 


G T AAAAAgTC 


TtCGGTTACC 


TGAGAGACGA 


12600 


TGTCTTTAGG 


GAAACCGCAC 


ATGTGTGCGA 


GTTCTATGGG 


ACCGAATTCA 


AAATCGTAGG 


12660 


CCTTGCCGGT 


GCTAGGTATG 


TAGCGAATCT 


TTTC TAACTG 


GATGGCCAAC 


ATATCGTACA 


12720 


TTTTTTCAGT 


CGGTTCCGGA 


AGCAAAGTAT 


TTGCGAGCTG 


TCGGTACATC 


GACCAGATGC 


12780 


GATCTGCGAG 


CGTGGTGGTA 


AGACGCGCAg 


TCAATTGCGG 


TTGTGTGGCT 


ACCAGCTGTT 


12840 


GGAAGTTCTT 


TCGGTTCACG 


GCCAAAAGCT 


GGCAACCATC 


AGACATAACA 


ATGGCGCTTG 


12900 


CAGAACGCGG 


CTTGTTCTCC 


AGCAACGCCA 


TTTCCCCAAA 


CATATCTCCT 


TCTTTTAAAA 


12960 


TCGCCAGCAC 


TACCTCATTG 


TTATCAACAA 


TCTTAGTAAT 


TTTTACATGT 


CCTTTTTGAA 


13020 


TGATGTAAAA 


CTCATTTCCC 


AATTGACACT 


CACAGAACAC 


CATCGCCTCT 


CGATCGTAGC 


13080 


AGCGCGTGGC 


TTC AAGTATG 


TTAGGTTCGA 


GTATTTCTAC 


TGGTACCTTA 


ACTCC TGTGG 


13140 


ATTTAATCGC 


AACAAATCGT 


TTGCGTGCTT 


CCTCTGCATA 


CGTCCCCTTG 


GGACTTTCCT 


13200 


TGAGATAATG 


ATAGTACGCA 


TAGAGCGCAA 


GTTCAAACTT 


CGTCATTTTG 


ACGTAGTATT 


13260 


CGCCAATAGC 


GAAAAGATGC 


GAGACATCCA 


CATCAGTGTG 


TTTTTTCAAT 


GTCAATTGGG 


13320 


TAAGCGCCTC 


ATTGAGGTAG 


CGCATTTTCT 


TTGTGAAGGA AAGGATGATT 


TTC A T ACST* A A 




TCGCCGCGTT 


CTTTTCAATG 


AGCTGGGGGA 


ACTGCTCATA 


ACGAATTGCA 


ATAAGCACGA 


13440 


CATCAGTGAG 


CGCAACTGCA 


GTTTCAATCT 


GATTATGCCG 


CGACATGCAG 


GCAACTACAC 


13500 
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GCGTCACCTA CGTTATGAAT ATT AC CGACG 
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CTTCCTCCTC TGCAACTATC TCTACTTGTT 13 560 

TATAGAAAAG ATCTGCGTCA gcTTTCCCCT 13620 

TAACAAACGT CAGCTGTAAC AAAGTATCCC 13 680 

TTTACAAGAC AACCCGTACG TCTTGCAGTG 13740 

CTGGAAGTCA TCCTCACAGA TTCTCCAGAA 13800 

TGTGTTTCAA ACACGCGTTG CCGAACGACG 13860 

GCATTCCCCG TATTGATAAT GAGATTTCCA 13920 

GAGGTGCCCC GTAAACCTGC CTCTTCTATG 13980 

TTGTTTTTAA ACGCGCTGCC TGCTGACGGA 14040 

GCAATCTTC T CCTGCATGTG CTTCCTAATC 14100 

CACAGCGAGA GGATAAGACG CCTTCCTGCA 14160 

GGAGAGCGCT TGTAGCCCCA ATCCCCGCGC 14220 

TAAACGGTCC GCCGTCGAGG CCAAGACATT 14280 

CCTCTGGCAG TTCTTTTGCG CGCGAACGcA 14340 

GAAAnCAAtC TGCGATTGCA CGCCCATAAC 14400 

CACTACCAGG CAGCCCTGCA AAGGTCTCAA 144 60 

CCAGGAGGGC GGCCACAGGT AACCCCGCGC 14520 

GTGTTTGGGT GTGTAGACTG CGAAAgCGAC 14580 

c G TCTGCG AT TAACACGTTA GAGCCTCCCC 14 640 

GCGCTTcCTC AATAAGCGCG CGCAgcTGTG 14700 

CAGCgCaCCA ATGCGGAAAG AACATCGCTC 147 60 

ACGCGCGCGT ATCCGGTGCG CGGACATGGA 14820 

GCTACAATGG CGCGGTCTTG GCCTTCTTTG 14880 

GCGGGCAAAG GCGTTTGCGC GACAATGGGG 14940 

GGTTGGGAGA AACTTCGAAG GGGCGCACAT 15000 

TCAGCAAGAG CACTTTCAAC CCTGGGAGCA 15060 

CTACGGTGTA CAATTATCCC CATCTGGGGA 15120 

TTCGACGTAC CTTGCACTTT CTTGGATACC 15180 

TTGGGCATTT AGAAAGTGAC GCAGACAGTG 15240 
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GTGAGGATAA 


GCTGGTAAGG 


AGCGCACAGG 


CGCATGGCCA 


CTCGGTGTTG 


CAGGTTGCAG 


15300 


CGCACTATCG 


CGC Ac CTTTT 


TCCGCGATAC 


TGCACTGCTC 


GGTATTGAAG 


AGCCGTCCAT 


15360 


TGTCTGTAAT 


GCCaGCGATT 


GTATCCAGGA 


TATGATCGCG 


TTTATCGAGC 


AATTGCTCGC 


15420 


GCGTGGGCAC 


GCGTACTGTG 


CAGGAGGGAA CGTGTATTTT 


GATGTGCGAT 


CCTTTCCTAG 


15480 


CTACGAAAGC 


TTCGGTTCTG 


CCGCGGTAGA AGATGTTCAG 


GAAGGAGAGG 


ATGCGGCGCG 


15540 


CGCGCGGtGG 


CACACGATAC 


GCATAAGCtG 


ATGCACGTGA 


TTTTGTGCtG 


TGGTTTACCC 


15600 


GTAGTAAATT 


tGTGCGTCAT 


GCGTTGACGT 


GGGATtCTCC 


GTGGGGGCGG 


GGGTACCCCG 


15660 


GTGGCACATC 


GGGTGTTCTG 


CAATGAGCAT 


G AAGTTTT T A 


GGACCACGTT 


GCGACATCCA 


15720 


CATCGGAGGG 


GTGGATCATA 


TTCGTGTGCA 


TCACCGTAAC 


GAGCGTGCTC 


AGTGTGAAGC 


15780 


AATTACTGGT 


GCACCCTGGG 


TGAGGTACTG 


GTTACACCAC 


GAGTTCTTGC 


TGATGCAGCT 


15840 


GCAAAAGCGC 


GCAGTACATG 


CGGATATGGG 


CAGTTCGgTG 


GTGTCGTCTT 


TTTCTAAAAT 


15900 


GTCCAAGTCC 


TGTGGGCAGT 


TTTTGACGCT 


TTCTTCGCTG 


CAGGAgCGTG 


CTTTCAGCCA 


15960 


GCTGATTTTC 


GCTTCTTTTT 


GTTGAGTGGA 


CAGTATCGCA 


CGCAACTTGC 


TTTTTCTTGG 


16020 


GATGCGCTAA 


AAACGGCGCG 


TGCCGCCCGA 


CGGAGTTTTG 


TGCGGCGAGT 


GGCGCGTGTA 


16080 


GTGGACGCTG 


CTCGAGCAAC 


TACAGGCAGC 


GTGCGCGGCA 


CTAGTGCAGA 


GTGTGCCGCA 


16140 


GAAAGGGTGT 


GTGAATCGCG 


CGCATCAGAA 


TCTGAGCTGC 


TCTTAACTGA 


CTTTCGTGCT 


16200 


GCGTTGGAGG 


ATGACTTTTC 


TACGCCACGT 


GCTCTGAGCG 


CCTTACAAAA 


ATTGGTGCGT 


16260 


GATACCTCGG 


TGCCGCCATC 


GCTGTGTGTT 


TCGGCACTCC 


AGGTGGCGGA 


TACAGTGCTA 


16320 


GGGTTAGGCA 


TAATACAGGA 


AGCGACCGCA 


TCGCTATCTG 


CGCAGGTTCC 


TGCTGGCGAT 


16380 


ACGTTGCCGC 


AGCGTCCTTT 


ACCGAGTGAG 


GAGTGGATTG 


GACAGTTGGT 


GCGTGCGCGT 


16440 


GCACATGCAC 


GCCAAACGCG 


TGATTTTCCC 


CGTGCAGATG 


AGATCCGTCG 


GCAGTTGAAG 


16500 


GCTGAAGGGA 


TTGAACTTGA 


AGACACCCAT 


CTTGGGACTA 


TTTGGAAGCG 


CGTGTAACAT 


16560 


TTTGGGAGAT 


ACATTGTTGC 


ATGAGCAGGA 


GCTTTTAAGA 


GCACAGGATG 


ATGCAGATTT 


16620 


TAAGCTCATG 


TACGAGCAGC 


TTGTGCCAGT 


GCTCTAsCGC 


GTAGctAcAA 


CGTGGTGCGC 


16680 


GAGGAGGACA 


TCGCTGAGGG 


GCTCTGCCaT 


GATGCCTTCA 


TTGCAtGACA 


GAAAAGAGGA 


16740 


TGGAgTTTCC GTCTCTGTCG 


GACGCAAAGT 


ATTGGTTGAT 


CCGCGTGGTG 


AAAAATGCCT 


16800 


CGTTAAATTA 


CGCTAAGCGT 


CGTGTACGTG 


AGCGTCATTC 


T t GTGAGCAA 


GCGTCGCGCG 


16860 


AGCATGTGTG 


CGAGCCGGAT ACCGGTgrmT 


TCGCTTGTTA AGAATAGAGA CGATTGAGCA 


16920 


GGTGCGCGCG 


GCCTTAGATC 


GACTGCCCGA GCACCTCCGT 


GTGGTTTTGC 


AGTTGCGCGA 


16980 
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GTATGGGGAC 


TTAAACTACA 


AGGAGATCGG ACGTATCCTG 


GGCATCAGCG 


AGGGGAATGT 


17040 


AAAGGTGAGG 


GTGTTCAGAG 


CGCGCGAACG 


ATTAGCGAAG 


TATTTAGGAG 


AGACGGATGC 


17100 


GTACCTGTCC 


TGATTGTGCT 


GCTTGGTGTG 


CTTATGTGGA 


CGGAGAAGGT 


TCGCAACTGC 


17160 


AACGCCGTGA 


GATGTGCGCG 


CATCTGCAGG 


GTTGCACACA 


CTGTGCCACG 


TGTGTGGCGC 


17220 


ACTATCGCGC 


CATGCGGAGT 


CTTGTCAAGC 


ATGCTGATCG 


CGTTTCTTCC 


CGTGATTTTA 


17280 


CAATGGCTTT 


TCCATATTTG 


CGCGTGCGTC 


ACCGTGTCGC 


TTC CTG TATG 


CCGAGGCCGT 


17340 


GGTGGCAGGC 


ACGTTCCTCT 


CCTCTTTCTG 


CTGCAGGACC 


GGTCCgTGCT 


GCGGCACTCG 


17400 


CTGTGGCGGT 


CGCATCTTTA 


TGTGTATGCA 


CCCTGTTGCT 


TACTCATATT 


GTTGAAAGGC 


17460 


GTCCTGTATC 


CCGTGCGGGT 


GAGGCGAGTT 


TTACCCCCAT 


TGTACCTATG 


CGTGTTCGCG 


17520 


CCCCTGTTGG 


GTACGCGCGC 


GGTGTGAAAG 


TGTTTGGTCC 


TGCCGTTAGT 


GCGAATTCCA 


17580 


ACGTgTGCGC 


AAAC CAGCTG 


CGGTGTTCAC 


CGTCTGTGCG 


TTTGCGCAGT 


TGTATGGCTC 


17640 


AGATC CTGCG 


TATGAAATGG 


AAACAGTGCC 


GGTGAGGCTA 


TCGGTTATTC 


CTGTGCCTTC 


17700 


C TATG TGCTC 


AATGCTTCAA 


AAGCGCAGTT 


CTTTTCCCCA 


TAATCCAGGC 


AAATGTGTAG 


17760 


TAAAAATAAT 


GCGCCCGCGC 


GGACGTGTTT 


CCTGTTCTTT 


TCAAACCGTT 


CTGAtCGTTG 


17820 


GGTGTTCCTG 


TCTGCAAACT 


TGATTGACCT 


GCTTGTCAGG 


TAGCCATAAG 


GAGAATGTCT 


17880 


ATGACCTTCG 


TTGAATCAAT 


GCAGCGGCGT 


GCTGTGcTTG 


CGCAAAAACG 


ACTCGTGCTT 


17940 


CCTGAGGCCT 


GCGAGCAGCG 


TACGCTCGAA 


GCCGCCCGTT 


TGATTGTGTT 


CAGAAACATA 


18000 


GCCGCAAAAG 


TTTTTCTTGT 


CGGATGCGAG 


CGTGATATCA 


AAAACACCGC 


AGACAGGTGC 


18060 


GGTATCGACC 


TTACCGACAT 


GGTCGTCATC 


GATCCGAGCG TTAgCAAGCA CAGAGATCAG 


18120 


TTCGCAGAAC 


GTTATTTTCA 


GAAGCGAAAA 


CACAAAGGAA 


TAAGTCTTGC 


CCAGGCTGCA 


18180 


GAGGATATGC 


GCGATCCTCT 


GCGTTTCGCT 


GCTATGATGC 


TTGACCAAGG 


TCACGCAGAT 


18240 


GCCATGGTTG 


CCGGTGCAGA 


AAACACTACC 


GCGCGCGTTC 


TTCGTGCAGG 


CCTCACCATC 


18300 


ATCGGAACCC 


TTC CGAGTGT 


TAAAACTGCC 


TCTTCCTGCT 


TCGTTATGGA 


TACTAATAAC 


18360 


CCCCGTCTGG 


GAGGAACACG 


TGGTCTATTT 


ATTTTTTCAG 


ACTGTGCAGT 


GATCCCCACT 


18420 


CCCACCGCAG 


AACAGTTGGC 


TGATATCGCC 


TGC TCTGCTG 


CAGAAAGCTG 


CCGCACCTTC 


18480 


ATTGGAGAGG 


AACCGACTGT 


CGCACTTCTT 


TCCTACTCTA 


CTAAAGGATC 


AGGAGGTGAT 


18540 


AGTGACGAGA 


ATATC CTGCG 


TGTACGTGAG 


GCAGTCAGGA 


TTCTACACGA ACGGCGGGTG 


18600 


GACTTTACCT TCGATGGGGA ATTGCAGCTC GATgcTGCGC 


TCGTACCTAA 


GATTACCGAA 


18660 


AAAAAAGCGC 


CTCACAGTCC 


TATTACGGGA AAGGTGAACA 


CACTCGTGTT 


TCCCGATCTT 


18720 
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TCTTCGGGTA ATATTGGGTA CAAGCTTGTC CAGCGCCTTT CAGATGCGGA TGCATACGGA 
CCTTTCCTGC AAGtTTTGCA AAACCACTGT CTGATCTCTC GCGTGGGTGC TCGGTTGAAG 
ATATCGTCGC CGCTTGTGCA GTCACACTTG TGCAATCGAA TGGACGCTAA TGACGTCC AC 
CCAGGCGCGT ATACGTGAGG CAGTCCGTGC AGGGAGCGTC CGAGATTATG CGCGTGCTAT 
CCGTATTCTT GAAGAGCTTG CCGCTTCAGG AAAGGCAGAA GGATGTCATC ACCCAGATGG 
CGGTGCGGTG TATGAGAGGG GGGCACAGGA AGAGTGGAAT GAGGGGTCGT CTGAGTCGCA 
CGCGCACGGT GGGGATGGTA CGCAGGACGC GTATCCTGAG ATTTATTTGT ATCTTGCGCG 
TGCATACCAC GCACAAAGGC AGTATGCGCG CGCGGTAgTA ACGCTACTGT GTATTCTAGG 
CGCGTGCCGC gcGrACGGCG CAGGTTGGTT CTTTTTGGGA AGGAGCTATC TTGCACTGCA 
TCAGGGGGGG TATGCGGTTG CAGCGCTTCG GCGCAGTGTA CGAGAAAATC CTGCCTCTCT 
TGGGGCGCAG GCGCTGTTAG GACTCGCCTA TCTGCGGAGT AAGAAGCCGC GTGCAGCGCG 
CATGGTGTTT GAGCAAGCAC TTGCGCAGTA TCCAGACAAT AAGCGTTTGA ACGCAGGGTA 
TTTGAATTCG CTTTTTGTAG AAGCAGTGCA GCATCTAAAA CGGGGGAGCG CAGATCTTGC 
GCGTCAGATG TTTACGTTTC TGATTAATCA GGATGTAGAC GGGGTTGCGC CACGTTTATA 
C TTGGCGC AC GCGTTTCGTT CTTTGAAACA TTTTCCTGAA GCGCTTACCC AGTATCGTGC 
AGCAAGCGCA TTTGCGCCGC ACGATCCTGC CCTCAAGTGG TACGAAGCGG CCATGCTTGT 
AGAAATGGGG TGTCTGTCGC AGGCGGCAGC GTTGCTGTCG ACGTTGGGTG TTTCCATCGA 
GCGTGATCAG ATTTCGGATC GTTTTCTAGT GATGGGCGCC GTGCGCAAGC ACATGGAGGA 
GGGGGCGTGG GCTCGTGCCG CTTCTGCAGC GCATTTATAC CTGAAAACTT TTGGGGGTTC 
TGTAGAAATT CACCTGCTAA TGGCAGAGGT TCACCGGCGT GCGGGGCGCG TGAACGTGGC 
TTTGAACCAC TACACGCGTG CGATGAAAAT AGAACCGAAA AATTGTTATC CGCATTATGG 
TCTTATGGTG TGTTTGCAGG AAGCGAGGCG CTGGCAAGAG CTGGCAAAGG CAATCAGACG 
TGCAGAAGGC GCAGGGTGCG ACGCGCAGGA TTGCTACTAC TACCGGGTGA TTACAGCTGC 
CCATTTGAGC AATCtCCCGA GGAGGTGTTA CCGCATCTGC AAGAACTTGC GCGTGGAGGG 
AAGGCCGATC AGCTTTTGTT CAATGCTCTT GGGGTAACGT ATGTGCGACT GGGAATGGCA 
GATCTCGCAc TTCGCTGGTA TGAAAAAACC CTTCTTCTGG ATGCAGAGGA CGAAGAAGCG 
TGCGTGGGAC TGATCGCCTG CTAcGAgGCG CTCTGCGACG AagCGcGCGC GTACACCCAG 
TATGGAGCGT ACCTGTCCCG CTGGAGGGAC AATCGGGTTA TCCGcAAGGA TTTTATAGCC 
TTTCTTGAGA GAACAGAACG GTGGTCcGAA GCGGCGGACC ACATCGAGTT GCTCGCCTCG 



fe/13041 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
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GGTGAGCGAG GGGGTTTTTG GGGTACTCGC CTTGCGTTTG CGCGTAAAAA AGCCGGCCAG 
TACAGGCAGG CTGCAATTAT CTACCGGGCG CTCTTACGTC AGAGACCGGA CGAGCGGGTT 
TTACTGCACA ACTTGGTATA CTGTCTTGAC AAGATGGGGC AGGCAGACGC AGGGCTAAGG 
CTGTTCCGCG CTGCGTGCAA CGCGTTTGGG ACGAGCGTGG AA 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1356 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



'13041 

20520 
20580 
20640 
20682 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

TTTATGCACC CCAnTGAATC GACAGCCCGA CTTCAGAnCA CACAnCCCGC GCAgCCACAg 60 

GATGAG CTc C TGCGGCAAaT GGTTACACAA ACACCGTCAC TTC AC CGC AG CATATATCCC 120 

TGCACCAATm rCGGCTACAC CGCACACAAC TGCAACCCCG ATAAGAACAT TATTCGTGAT 180 

CTCTAAACGC CTCAACCGCA TGTTCAAykm GTTCACACGC TGCCTCAATA TCTCCAATTC 240 

GCTTCTCAAT GTCTCTATCA ACTC TTTCGA TTCGCTCAAT GCGTGCTCCG ATTCCTcCAA 300 

GgCCTTGGCG GCCTTCTCCA ATTTCACGTC GAGCGTCTGT AAGGCCTTTT TGAGCGCGGC 360 

CGATTCGGCG TGCCGTTCCC TCAACTGCTG CTTGAGCATG TTTGATTCGA GCCGGATCGA 420 

TGCCACcTCC CCCATAATTT CCTTTAAAAG CCCACCAGTA GCCGCCTGCG AATCCGCATA 480 

TGCCACAAAA GAGCGCAACA ACACCATACC CCACAATAGT GCGCCCACAC CCCGCTTCCA 540 

CATTCGGTCT CCTCACGACG ATGCGTTCAC CTTCCATTCA TGAATAAATC CAAGCATGTA 600 

TTGCTGGATA TCCTCAAACC ACTTCTTTTG GAGTGCGCAC GCAGCaCCCC TACATAACGG 660 

AAATGCCACG GCTCCCATAC ATACCCCGTC ACCTGCTCGT AACCAGGGGG AAAAGACAGC 720 

GACCATCCAA AACGATGGGC GTTGCGCTGC GTCCACCTCC C TGC ATC ACT CCGTGCAAAC 780 

GCCGGCGTGA TAGAACCGAA ATCCACTACC GTCCCCAACT GGTGCTGACT TGTTCCTTCT 840 

CGCGCGGAAA AACGCATAGC CTCCTGCATG CCATGCTCCT GCGCATACCA GGAGAACAAC 900 

TTTTTCTGAT ACGCAAAAGA GCGATAGGCA GAACCAACGG ACAGTGCCAC CCCGTCACGC 960 

GCAGnCGCCT GAATCAGCTG ATGTAACGCT TCGTACGCAA TCTTAGTTAA AAGGAGCGAC 1020 

CTCCCCTTTG AAAAAAAGAG CCACTGCTCA CGCACCGGCA CCAGATGCTT CGGCACGAAC 1080 

GTTTCTGGCA GGGGATGTTT CTTGTCAACC AAACGCAACA GATACCCCTC CGTGGTAAGT 1140 
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ACCGCGTCGA GCTCTTGCAA AAACTCCCTC CCCTGTGCAC ACAAACGCGT ACAAGGCGCG 
CAGGAAGCGC CGCGCG tTCG CAGCAGCACG CACACGATGC AGATCCACCC GATCCACGCC 
CTGcGGcGAG ACCGCATTTC CCAGAGCAAC TAACACGTAC CCTGGCATAC GCACACGCAT 
TCCAACGCGC CAnTTAGTCC AACTCATCTA ATGATT 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



f/13041 

1200 
1260 
1320 
1356 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

TAC TGGTGGT ATCCACCTCA ATGAACTGTT TTTCCCTTTG TGGGAATCTT CTGAGGATCT 60 

CCaTCAACGC GCAGGAGCTT GGCTGTATAC TTATGCACGT GATGCAGAAA AGGAGCTCTG 120 

ACATACAATC CAGCGGTGTT ATTCGTTTTT ATGATTTCTC CGAACTGGGT GATGAgCGCA 180 

AcCTGTCCTT CCTGGATGAG GTAAAACGGT TGCAGGwGCA CAACACCACC TAACAGCACC 240 

CCAACGACTA TACC TATGTT CAGAACAGGT CGTAaCGTGC GTGTACCTGT AGTCCACGTT 3 00 

TCCTCATACC CCTACTCCTC GCGTGTTCCT GCCACGACCT TCTTCGATAC CTTACTGATA 360 

TCCTTGAGCG TTAAAAGATT CTCCAGTTTT TTGTCAATCA ACAGCACATT TTCAGTCTTT 420 

TCCAGGATAG CCCCCAGTCC CTCAAGGTAC AAACGCGTTT TGGTAACATG AGGTGCTTTG 480 

ACATATTCAG CATAGATTGA GTCAAAACGT GCTACATCTC CTTTTGCTCT ATTTACGCGT 54 0 

TCATTCGCAT ATCCCATAGC CTCCTGAATC AACTTGTCCG CGTCACCTCG GGCCTTAGGA 600 

ATTTCCCTAT TGTAGGACTC TTTTCCCTCG TTAATGAGTC GATTCATATC CTGAATAGCA 660 

ATATTCACGT CTTCAAACGC TTGCTGTACC TCCTGAGGAG GAACAACATT TTGCAGCTGC 72 0 

ACGGAGGAAA CAAGAACACC TAGGCCAATC CTTTTCAGGA GAACATTCAT CATATCCTTC 780 

GCACGCATCT GAATCGCACT GCGCTCCGGC CCCATGATAT CAAGAATCGC TCGATCTCCA 840 

ATTAAACTGT TCACCACTGC TTTTGAAATG TCTCGAATGG TTTGCCTTCG CTCCTGGGAC 900 

TCAACATTAA ACACCCATGC TCTTGGATCT ACAATGCGAT ACTGAACCAC CCACTCGACG 960 

TCTACAATAT TCAAATCCCC CGTAAGCATA AGAGACTCGT G AC TG AT ATT ATTCACATAG 1020 

TGACTCTGCT CGGAACTCTT CGACGTTCTG AACCCGAACT CTTCCTTTTG CACCTTGGTT 1080 

ACCGGCACTT TATACACCCA CTCTACAAAG GGGATAAGAT AATGCAATCC CGGTTCTAGC 1140 
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GTCCGATGAT ACTTGCCAAA ACGGGTGACC ACCCCATTAT CAGTGGGAGA AATGATCCTA 
ATAGGGGAGG CAATTCCAAC AATCACGATA CCGAGCACCC CACCTATGCA TCCTGCCACC 
ACGCTCCACG TTGCTGGAGT CCACTTTGGT ATTCGCATCA CGGCACCTTC CTACACACGC 
TCGTCTTTTC GCCCATCTTA CGGGAAACAT TTTCCTGTGA CAACACTCAC CGTATTCACA 
CAGAcTTCGT TGTAGACAGA ATAAAAATTC TCACTCAGTA TAAAAACACA GGAGGCATGA 
TGTATCTTAC AAAGGAACTA CTCGATACGT TTGCGCACGA AGTCGCCGCA GATCCTATAC 
ACAAAGCGGT CGCAGGAGCT GTTGCGCGCG TCGGTCTTGA AGAAGCTGCA C TGAAC AC AG 
AAGTGGCGCG TCAgCACACA CATATTTTTT CTACCGAGAC AAAACGTGGA GAAATGACCA 
ATCAAAAAAT GAGTGGTCGC TGCTGGATAT TTGcTGCGCT CAACGCCGCG CGTGTAAACA 
CCATGAAAAA GTTGGACATT GAAACAGTTG AGTTTTCCCA AAACTATCTT TTCTTTTGGG 
ATAAATTGGA GAAAGCAAAT TTCTTTTTAG AAAATATCCT AGAAACACTT GATGAACCTC 
TCACCAGTCG GTTGATGGCA CACCTGCTTG CAAATCCCGT CCAAGATGGC GGGCAATGGG 
ATATGTTTTC AGGGTTATTA GAAAAATACG GTCTTGTGCC CAAAGAATGT ATGCCTGAAA 
CTTTTCACTC TTCCAACTCA CGCGTTCTTC TTGCAGTCCT CACTCGTCGG CTGAGGAAGC 
ATGCACAGCT TTTACGTTCT GCGCATGAAG AAGGCGTTGC GCTGCATACC CTGAGGGAGA 
AAAAGGAAGC GTTCCTTTCT TCCATCTACT CTATCCTCGT GAAGGCTCTC GGGAGACCTC 
CGGAGAAATT CG AC TTTGTG TACAAGGATA AGGAAAAAAA ATTTCACAAA GTCAGAGACC 
TTACGCCGCA GAAGTTTTTT TGCGATTTCG TCGGATGGGA TCTTAAAAAC AAAGTGAGTT 
TGATTCACGC GCCAACTGCG GATAAACCGT TTGGCAGAGC ATACACGGTT AAATTTCTAG 
GCACCGTAAA GGAAGCCCCG TGCATCTGCT ATGTCAATAC TCCCATTGAA GTGCTCAAAG 
AAGCTACAGC TTCTGCAATC CGAGCCGGGG AGCCGGTATG GTTTGGTTGT GATGTAGGTC 
AAATGATGAC GCGCAAAGAT GGTATCATGG ATACGGAGAT ATTCGGGTAC GAGTCGATGC 
TCGGCACTAC CCCTGAATTC AATAAAGCAG AACGGCTTGA CTATGGCGAA AGTCTTTTAA 
CACACGCGAT GGTCATAACC GGTTTTGACG AGGATGCACA AGGTAAC CCC GTACGCTGGC 
AGGTAGAAAA TTCGTGGGGA GATGACACAG GAAAAAAGGG CATGTTCTCT ATGAGCGATC 
GCTGGTTTGA CGAATATCTC TACCAAATTA CGATCGACAA GAAGTTCG T A CCACAGGTGT 
GGCTCGATGC GC T AG AG AAG CCAATAATAG CGCTCGAACC TTGGGATCCG ATGGGAGCGC 
TGGCGGACAC CCCTCTGTAT CTTAAAAATT AAGAAGAAGA ACAAGTGCGC AATTCTGATC 
GGTACTTATT TACGGTACGT CTTGCGCACT TGATGCCCTG CTCACCGAGC AACTGGGCTA 



13041 

1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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TCcTTCGGTC 


GGAAAGGGAT 


ATACGCTGCG 


TTCGTACCTC 


TTGTATGAGA 


CGTGATATCC 


2940 


GGTACTTAAC 


TGATACTTTT 


GAGTGCGGAG 


AACTCGGGTA 


ATTCTGTCCA 


AGACTGGACC 


3000 


GATCACGATA 


TTCTTCGGTG 


GATAAAACCC 


GAGGGGAGAA 


AAAGTACCTT 


AAGGAAAAGT 


3060 


GTTGCGATCC 


GTACTGGAGC 


CATTTGTCGC 


GCACTATGCG 


GGACACTGTT 


GAAACGCTCA 


3120 


ATCCGGTCCT 


GTGTGCAACA 


TCTGTCATTC 


TCAGGGGCGT 


GAGctTCGCA 


GGTCCGTGAT 


3180 


CAAAGAAACC 


GCATTGGTAG 


TGAACTATTG TTTTCGCGAT ATCCAGCAAG 


GTACGTTCCC 


3240 


GGTATGAGAG 


CATACTTACA 


AGACTGAGCG 


CGTCGTGCAT 


GCATGCTTTC 


AACGCGTGGT 


3300 


TTTTTTCTGC 


CGCTTTTGAA 


TGCATGCAGT 


AATCGTTTCG 


GAAAACCACA 


GTTGGGATGC 


3360 


CCGTGCAGTT 


AATCTGTGTA 


ACAAACCCGT 


GCGCAGTTTT 


TGTAATCAAT 


ACATCTGGTT 


3420 


CAAGCAACAT 


GTTC GTGTC A 


GcCCGCTGAG 


CGTTCGACAC 


ACACTTACCT 


GGAAAGGGAT 


3480 


GTAGTTCCTT 


AATGAGGAGC 


AAAATATCTT 


TCACGTCATT 


TGACGAAACC 


TTCTGCACAC 


3540 


AAAGCCCCAT 


ACTATTAATC 


TGTGTCGTCA 


GCGCGTGCAC 


GGATACGCGT 


CCATCACACA 


3600 


TATTGTCAGA 


GCAGAAAAGC 


AATTCGCTGT 


GGTGTGTTAG 


TAGATTGATA 


ACACATCGAT 


3660 


ACAAGGGATC 


AG AG AAACG C 


TCAAAGCGCA gCCGCGCTTG 


GACTGCCAAT 


GATTCTTTAA 


3720 


AATTAAAAAC 


AGCACACCCT 


TGTGGCTCAA 


GTCTTTGAAT 


GAGCGCTATT 


GCC TGcGGTA 


3780 


TTTTTTCTTG 


AAGGGCTGTG 


GGCATACTAC 


CACACATGTT 


CTGAAAGATC 


GCAGGAGATA 


3840 


TGGAAAAAAA 


ACCGTGATCA 


TCTAACATCT 


GGATAAACGC 


GCACGCCAAA 


TCGAGCACAA 


3900 


TCGCTTCGTG 


TTTTTGATAA 


AAAACTTGTT 


CACGCAATAC 


AGCTCGGATA 


TTGTCAACCT 


3960 


GCTTATCGGG 


CTGATTTTCC 


AGCAATTGCT 


GAAAg CGATC 


ACGTGCGCGC 


ATGCgc tCAC 


4020 


GTCTATCACC 


GAGCGACAGG 


TAACAAGCCT 


TCCCAGTGCG 


ACGAGCGGAC 


GAAGGACGTA 


4080 


TTTCTAAAAG 


GGGATTGCGT 


TGCACGGCAC 


GGAGAACCTC 


GGTCTTCAAA 


TCCCCCCGAG 


4140 


AAAGCTGCAG 


TAAACAAAGC 


CCGTGCACCA 


ACCGCTGATT 


GAGAACTAAC 


CGCTGCTGTT 


4200 


GAACAAGCTG 


CTGCATATCA 


CCACGTGCCC 


TGCAAACCAA 


TCCCACGAAA 


GGAATACAGA 


4260 


AAAGGAGCGA 


AACGCTCACA 


GGTGCGTAGT 


TTCTAGGTTC 


TCCATTTCTA 


CTAGCGCACG 


4320 


CGACGCCGTT 


TGCACATACG 


CTCTCGCCTG 


AGAAGCCTGC 


TCACGGAAGC 


TAGCATTGTC 


4380 


GCTCCCACGG 


ATATCAGACC 


AATTGGTAAG 


CGTTGCGTAG 


GCACTTGTTT 


TCGACGCCCC 


4440 


GATCATACTC 


GCCAACATAG 


CGCCAATGCA 


ATGAAATGCC 


TTTAAGGTTG 


CATCAAGCAA 


4500 


GACAGTCGCC 


TCAGAGTCAA 


TTACCTTTAT 


CAGAGCTTCC 


AGAATACCCT 


TACTTTTCAT 


4560 


GGTAACTGTG 


TGCTCTTTA 










4579 
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(2) INFORMATION FOR SEQ ID NO: 73: 



PCf^K/13041 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1015 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

TTCCCCAAAA CGAAGCGTCC AATCTTTTAA tAATGTGCAA GTTCaATATT CaAGGTACGT 60 

ATTGGGAACA GCGCAGAACT CTGTTTCATT TCATCCCAAA AGTATAACGA TGAGACAAAT 120 

GCATCCCACA GCTTAGTTTG TGGAGATTGC GATGACTGGG TGGGAAAGTA AGGCGTTAGT 180 

TTCTCATTTT TAATCACAGT GTTTATAGAA AGATCGAGAA ATTTATAGAT GGAAAAAGTT 240 

ATTAGCGGAG AAAAACTGAT ATGACTTTTA TCCAGATCTT TAAAATTAAT CTCTAACGAG 300 

GAAGACAATG TCCC CTGTAT TTTTATACGC CGCTTCCAAA AACGTACTGT CAACGGAAAA 360 

TCATCATGAG ACAACGTGAG CTTTAATTTA CTTATAGAAA GACCATTATT TCCAGAAGGT 420 

GCAGCCTTGC CAGATTCCCC CTTTAGAAGA TAAGAAAGAG AAAAATACTT CCAGCCGAGC 480 

GACACTTCAT AAGAATCGCT CATACTTTTT CCAATATCGT ATACGTAGGT TTGTTTACAG 540 

GTTATTTCGT AAGGCATTCG AAAATCTAAT TCAGCGCGTA CcTGCGGTTT ATCGAGAAAG 600 

CGTGCAGAGA GCGTCGCAGA GATATACGGA AAAGAAAGAA ACGTCGCAAT GCTATACGCA 660 

TATGGTCCGG GCGCAAAATG CACACTGACA CGTATCTGCT GTGTATATCC ATACAGTGAA 720 

AAAGCAGCAT TTGCGGTGAT GCTATGATCA CGTACATGTA ATGAACTTTT CCCAGTAGGT 780 

CGAGACGAGT CGTATAGCAC GGGGGTAATT GACCAACCGA GAAAACTTTC TCGAAAAAAG 840 

GAATCTGCAG AAAAAGGATA CACTGCAAGA TTATTCGAAC TCTGCACTAA AGCGGAGTTC 900 

TGAAATCCAT TTTGTTTTGC CAGGTGGTTG ACTATGAATC GGATATCGGT GCTGCAGAGT 960 

GCACTGTATC CCGTTTGACA TGCGAATTAT ATCATTTACA CACTGAAACT GAGAG 1015 
(2) INFORMATION FOR SEQ ID NO; 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
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AAAACAGATT TGTAATGTAC CATCTGCCCA TGGATATGGT ATCTGCGGCG TCCGCGCAAG 
CCTACCCCCG CAGCCCCCTG ATTGAGCGCT CGTCCCCTAC AGTCTCACAC TTTTTGCCGA 
GAATATCTTA ACCGTGCTCT GTCAGCTCAA TACTTTGTCT ACAAGGAGAC GCGCCTGCCG 
TGAGAATCCA TCAACGCTCT GCTCCGTGCG TGCCTGTGCT TCTTTTTCTC TTCTTGCCGA 
GTGCGCCGCT TTGTGCGCGG GGTAGCAAGG ACTGGACGCC GCCGCAACTG GGCGAGGTGA 
TAGAGAGTAC CGAGCAGGAC CTTGCAGAGT TTGATGCCGG CCTTTTCCGT GCGGATCGCA 
TCCTGGATCG CCATGACCTC TACCGCAAAA CCATGCACCA GCTGTTCTCC ACGCTCCTTG 
AAGAACCTAA AAACCACGCT AAGCACCTGC AGCTCATCGA AACGTTAGAA AAGCTCGCCG 
GTCCAGAGAG CAAAGAAATA CACGAGTTTC TCAATCGAcT GCGCAATTCT TCTACGTACG 
CATGTACGCT GCCCGTTTCT TTCACCTCAT GGAGCGGGCG CGCATCCTCA TGGCTCGCCA 
GGAAT AC C TG AAGGCCGCGC TCCTGTACCG AAGCGGCTAC GAGCTCTACT ACGATGAGTA 
CCTTGCCGAC CCGTCAAGTC CGGGGAAAAA GGAgGTGCGT GCTCGCGTCG AGCAsgCAnA 
TGcGCATGTT TCCCGCGCAA AGCCCCTCCT AGAAGCGGTC GCCGCTGCAC GGGCTCAGTA 
TCAGAACACG CAGAAAAGGA CGTATGCTGC CAGCGCCCAT GAaGGCTGCG CGCGCGCGCG 
AcGCGTACTC TGCCGCCCCC GTGCGCCTGC TGcACCgGTG CCCGCGcACC GAGTGCAGCG 
TATCCTCACT CCTTAACGGT GGAGGCAGAA TTAAGGATTT TGCAGGACTT TTCTAAAACC 
ACTGAGGAAA GCGCCGCGCT CACTTCCCTG GTCCaAgCGC TTGGAGCGCT TTTAAAGTTT 
TCTCGCGACA TAGAGCACAC CGGTGTTGTT TTTGAACAGC TATCCACACG CGCGCAGAAA 
AATAACGAGA CACAAGAGGC CTTCTTGGCC GTTGCACGCA AAATTACGCT CGGGCGC AG T 
AAACTTGAGT TCGAAGGTAT TCTCGGCGCG CTCCAGGCTC CTGCCTTTGA CGCTTTTGTA 
GATCTTTTTG AAGCAGGTCG CGCACATGTA GCGGCGCTCC ACGACCAGGC GCGCGCACAG 
TTTACGTTTG CACATCCTCC GCACTCAGGC AGAAACATTC CCGCACCCAC CGACACTGcA 
CTGGCAAGTG CAGGCGCATG GGCAGCAGTC GGTGCAGGAC CTGCAGGATC GCTCATTCCT 
GGCGCTCCTC TCAGTGCGGG AGTCGGCTCT CGCGGCGCGT GGGGAGCGTT GCCTGCGCCA 
GTAGAGCCGC TGCTCCGCCA GGCGGATGAC GCATTGGGTG CGCTTGCAAG ACTGTGGGCA 
GCGTGCGCCC CGCTCGGTGC CCAGCATGGC AGATTTCCCC GCGATTATGA GACCTTTGGC 
GCGCAGATTG TAGCGCTCAG TGCGCACGCC GAg CGTTGCG CGCCACAAAA CACGCGTACG 
AC TTTT ACC A TGCACTGCTC GCCTTCCAGC GCGCCCCCAC CGTGCCTGTT TCGGCTGCAT 
TGCGGCGTCA GGACCTTTCC CAGAATGAAG CGTTCGCGCG GgATCTGAGC GAACTTGCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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ACCACCAGGA GTTTTTGCGT CGTGCCCTTG CAGAAACCGA 
ATACGGCAAG CaCCCcGTCA CCGGGGGGTG CAGGGGATAC 
ATAAGGGAGG GGCAAAACAG AGCGCTGCCC CTGATACTGC 
AAGCGGGTGC GTCGGAGGAG GCTGACGCGT CGTCTTCCCC 
CGGCGCGTGC ACAGCTGCAT GCCATCCAAA GTGAGCTGTT 
AACGCAACCG CTATACCGCA CATATGGCGT TTCATCAGCA 
CTGAGTATGC GCAgnAGcTT aCCAGTGCCG AGGAAGCATT 
AGCGCAGGGT ACGCGCGTTG AGCTTTGTGT CTGAAACGGG 
ATATGGAGGC ACTTGATCGG TTGCTTTCTT TTTTTTCTGG 
AGCGTGGCTA TGCGTATGGG CTGCAGTCCC TGCGTGATTT 
TCTCTGCACG CGTGCAGACA CTTTTTTTGG CAGCAGAACA 
TGGCGCGTCA AGAAGCAGAG TACCGTTACC GACAGGCAGT 
ACTTTGGCGG TGCCCGTAAG AATCTGGTGC TATCTCGAGA 
CGTTGCGGTA CGACACCGGc tACGCTACCG AAACTGACAC 
CCTCAATTAA CAGACGGGAA AATGAACTGG TTGTAAAGGA 
AGGCAAAAGA TAAaTATTAC AaGGGAGAGG TGCTCGATGC 
CGAAAAATCG CTGGGCAGTT ACAAACGTCA CCGAGAATGG 
CTGTCATTAG TACGGCGGTT GCGCTCAAAA TCGGGCGGGT 
TTTACCCGCA GATGAGTCAG TTGTTACACC ATGCAGAGCA 
ATTTGAACGC GTCGCAGCGC CAAGAGATGG AACGGTTACT 
TACACAAAGT ACTGCTTGTC TATCCGTTGA ACGAGCGCGC 
TAGACCAACT GCTCGATCCC CGCTCCTTCC GGCAGCAGTT 
TCAGAGGAAC GTACAAAACC GAATCAAAAA AGGCCTACAG 
CAATCGATGC ACGCTTCTCT GGTATCGAAA AGCTGAAGCA 
GGGTTCGATT GCCGCCGCCA AACCCGCAGG C C ATTGC AC A 
CTGCGCGTCG TATCTTTGAG CGTAGAGACG CGGCGCTCTA 
TAGACGAGGC GCTTAAGCTG AATCCTGATA ACGATGCGGc 
TCCAGTCGCT CACCGGTGAC GGTGCGGTAA ACGTACTCAG 
ATCAGCGCGC cTTGCAGGAA CTCCAAAAAG GAAATAAGCT 



V^^l 



GTCTCTTTCC 
TCCAGTGCCC 
GCAAAAGGCA 
CTCCGAAATG 
GCGCcGCTTC 
CTCAGGCGTT 
GCGCTTTGAC 
TCCTCAGCAG 
CGAAGAAGAG 
GCGCACTCAG 
ACGGGCTATT 
GGAAGGTCTA 
AAAGGCCGAT 
GCGATTGAGC 
CGTGCGCGCG 
GGAGc GTTTG 
GGAAATTACA 
AATTCCTGAC 
GCTGTACTTG 
CGCCACCTCG 
AGGGCAGCTG 
TGCAAAAAAG 
TTTGCTCCTA 
GGAAGTGGAA 
ATCGTCGAAT 
TCAGGTAGCA 
TGCGCAgCTG 
TAGCGAAGAC 
CGTCGCCTCC 



PCT 

CCGCCTGCAG 
AGCCAGGCTG 
GTAGCCCAAA 
GCGGTGCGTG 
ACGcgCtTCA 
TCTGCGCTCG 
GCGAAAGACG 
GTGAGTAAGG 
TTCCTGTCTG 
TTTGAACAGT 
CACGAACGAC 
GGTCAAGATG 
TTGGCGCTCT 
ACGCTTGATT 
TATATCgCAC 
CTCATTCGTG 
AATTGGCTTT 
TTTGCACCTC 
CACGCGGCAT 
CGAGAGAATA 
AGTCTGAGAA 
CTCGATACCA 
GATTTGTACG 
ATCTACCTGG 
TTTACGCTGG 
ATTCAGCAGT 
AAAGATCGTA 
GAAAAAGAGT 
GCGGTGGTTG 



13041 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
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1/13041 


AGCAGCTGTT 


ACAGAAAGAT 


CGCAATAAGA 


AGTCGGCAAA 


GATTCAGCAG 


TTAAAAAAGA 


3540 


GGATTGACGC 


ACAATTATGA ATGCTCGTCT 


GTGCTTTTTT 


TCGCGTCTTA 


TCTTTTGCGT 


3600 


ACTTTCTATC 


tGTGcTTtGC 


CACTTGTTGC 


TCAGGAAGAT 


AAGCTCTACT 


GGGAAGATCC 


3660 


GTGGGCACTC 


AGCACTGAcG 


TGCCGCTTTC 


GTCAAAGTTG 


CGTATTCGCA 


CGATGTCGTT 


3720 


GCCGTCGTAT 


GGCAGGAAGT 


GACGCCAAAA 


AATGCTACCT 


CGGGAGAAAT 


ACGACTGTCT 


3780 


GCGTCTTTTT 


ACGATGGCAG 


TACGTGGCAT ACCGTGCGTA CATTTTCTCC 


ACCCCTTTTG 


3840 


TACAACCACC 


GTTCTCCTTC 


TCTTGCCTCC 


GTTGCTGTTA 


ACAGAAAAAA 


TGAGATTTTT 


3900 


GTTGCTGCCG 


CTTTTGATGC 


ACACACCATC 


ACCGTCTTTA 


AAACTACGGA 


TTTTGGAAAA 


3960 


TCATTTACGC 


ATACTGTATT 


GCGTTCTCAG 


GGAAGCGATA 


TTGTCGCCCC 


CTATGTGAGT 


4020 


GTTGCTTCAG 


ATGACTCGCT 


GCTGCTGTTT 


GCCTCTCACG 


GTTCTGAGGA 


TCACTTTTCT 


4080 


ATCTTGCTTT 


GCCGATCCGA 


AGATGGGGAG 


CGTTGGACTC 


CcTTTCAGGA 


GTTTTTGTCT 


4140 


ACCGAATTTA 


GCCGCAGACT 


CTTTTTGCCT 


TCGCATGTTT 


CAACGCAGGC 


CCAAGAAATA 


4200 


GTGGTGTTTC 


AGGCACATCA 


CCAAGAGGGT 


GAGAGAGCAA 


GCTATCAGTT 


GTATTCAACC 


4260 


GTTAGCTTTG 


ACCAGGGCAA TACGTGGTCT GCgCCTGTGC 


CTGTTACACA 


ACCTGATGAG 


4320 


TATCACAATC 


AGCGGCCCTT 


TTTGGATCGT 


CTCTCAGATG 


ATCGTTTTGC 


AGTTACGTGG 


4380 


GAGCGCTCTG 


AACGTACGTC 


GACGCGATAC 


GAGATGTGCT 


ATGCCGAGCT 


CGATCGCTAT 


4440 


GGGAGAAAAA 


TCGGGACTAC 


gCTCCGCCTG 


GCAGAACCTT 


CTGACCGTCT 


CATCACTCCC 


4500 


AACTTTGTGC 


ATATCGACGG 


TACCACATTC 


TGTGTGTGGG 


CAGGAGAGTC 


AGCCGGGCTC 


4560 


AATACCATTT 


TTCTCGCGCA 


GAAAAAGGAA 


GGCGCGTGGA 


GTACTACTGC 


CGTACGTTCT 


4620 


AGTGAGGATG 


CCTTGCTGTT 


TcCGCATGCG 


GTGCGCGTTG 


ACAATCACCT 


TGAGGTTTTT 


4680 


TGGCAAGAGG 


GAGAAGGGGC 


GCGTGCACGT 


GTGATGCGTT 


TGCGTCCAGA 


TCAGAGTGTA 


4740 


CAGCCACCGA 


CCCTGATTGC 


AGAAAATTTT 


TCGCCAAACG 


CGGTAAGAAA 


GGGGACGCGC 


4800 


GCGCGGtACG 


CATTGTATTT 


CCTCGGGATT 


CGTCAGGCAT 


TGCAGGGTAT 


AACTACGCGT 


4860 


GGCAATGCGG 


CGTGCAGCCT 


GCTGCTCCTC 


CTGATTACGT 


TGCACACTTT 


cCGGACAAAC 


4920 


CTCAGATAGA ACTGG AGGC A ACGCAGGATG 


GCACGTGGTT 


TTTGGCCGTA 


ACGGTGTGGG 


4980 


ACTTCGCCGG 


CAATAAGTCA 


GCTCCCGCGT 


ACCTTTCATA 


CACGCGGGGT 


ACTACGCCTG 


5040 


tGCGCGTCCA 


CAATTGCAAA 


CTCCTCTACT 


GGAGAACACG 


CATGCGCTGA 


AGAGCAACAC 


5100 


GTTTACACTC 


AGTTGGAATC 


AACCCAGTAC 


TGATGCGCAA 


GGAAACGAGG 


AGCGCGATCA 


5160 


CACCAGCTTC 


CTTTGGAGCT 


TACAACAGGT 


GGCACCGCTT 


TCAGC ACTAA 


CGTCCCTGCG 


5220 
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TGTGGATACT GATGTACGAA CGTTCGAAGA ATTTCAGCAG CGCTGCGTGC GCGCCTTTCC 
TATACCTGTG GAmGTGCACG GCAcGCGCAg CAGGcAGTCG TCCGTATCGT TCACTAATAA 
GGAGAACGGC ATC TATCGCT TTAG cGTAT A TGCCCTTGAT CGCTCTGGAA ACGTGAGCGA 
GCCCGCAGTT GTCTTTTTTG CCTTACGGCA TTTCGTACCC TACACCGCCA TTCGCTATGT 
GGATGTGAAA AAAGATCCTG CCGGTTCATT GCAGATGTCG ATTGTTGGTA ATGGGTTTCG 
TGCGCAAGGG ACAGTCAGTC AGGTATACAT CGATCGGGAT CGCAAAGCTC CATATGACTT 
GGTATTGCAT GCGCAGGAGT TCGCCGTTGG TTCAGACAAC CTTATTTCAG ACATACACAT 
CGATAATTTA AAAAAAGGTT CTTACCACGT GGGGGTATGG CACCCTGCTC GTGGGGTGCA 
TTTTGCAGAG TCAAGAGTGA CGGTTTCTGA AATGGGAACG GTAAAATTCG GCGCGTACGA 
CTATGAGCAT CAGGTGCGGT GGAGTATCCC ACACACTGGT GGATTGAGAG TGAATTTTGT 
TTCACTGTTC ATGCTGATAG CGCTTTTTCT TGCGGGTGTG GTGTTTGCAG CGTCACTTAC 
CAGGATAGGT GATATCGTCG GAGAAGCGTT TG T AC TT AAA AAGCAAGTGG AAGCGCTCAT 
GATAGGAGAG CTTATGCCGT CAGAGAAGAG ACGAAAGGCT ATGGCACTGA AAACACACGG 
TGCAGGATTG CGGGTGAAGT TCATCCTGTT TG C AC TT ACG CTGGTTATAT CTGTCATTTT 
TATTGTGTCC GTGCCGCTTG GAGTGCGGTT TTCAAAAACA CAAAAAGATT TGCTGGCTAA 
AAATCTTTTT TCTCGGGTTC AAGTGTTGCT TGAAAGTCTT G TGGCGGC AG GAAAGGTATA 
CCTTCCAGCG AAGAATAAGC TTGAGCTTGG CTTTTTGCCC AATCAAACAA CGGCATTGCA 
CGAAGCGCGT TACGCGtTAT CACAGGAGAA AGTGAAGAGC CTCACGAAGA AGGTATCGAT 
TTTGTGTGGG CAACGAATTT TAGCGATATT GAAACGGTGC TCAATGAGCC CGAATATCGG 
CAAGGCAATT CTCGTTTTGT TGACAAAAGG ATTGCGCAGA TTTTGCCGGC AATGGAGGAT 
TTGAACAGAC AGGTTAAGAA AGATGCAGAA AAGATAGCAA AGGGTATTGC GGATCTGACG 
CAGGAGGCAG TTGCGCTTGC GTTGCGCACT GATCAGGGGT CAGTACGTCG CCGAGATGAT 
ATTCAGTCCA TTACGCGGCA AATGGATCAA AGGCTTTTGG AAATTTTTTC TACATTTTCA 
AACAACGCGG TGGGCTCCTA CCCTGAATAT CGGGTTGATA ATTTATCAAA GCGTCACAGC 
TCCTACCTTT TCTATAAGCC CATCCTGTAC CGCCAACGCG GACACGcGgA TAGTTTTGTG 
CACGGCGTTG TGTTTGTAGA AGTCTCTACG CAGGAATTGC TCGAGCACAT TGAGGGTTTA 
CAGCGCGATC TCATTAAAAT GGTATTTTAC GTTTCTTTAA TCGCACTCGC CTGTGGGGTC 
TTTGGCGCGT GGATTCTTGC CTCTATTATC ATCAAGCCTA TACGCAGGCT GGCAAGTCAT 
GTGGCGATGA TTCGCGACAC GGAAAAAAAG GAAGAACTTG AAGGAAAACT GATTGCCATC 



5280 
5340 
5400 
5460 
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5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
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6420 
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AAAGGGCAGG ATGAAATCGC TCTCCTCGGA AGAACTATCA ACGATATGAC AGAAGGGTTG 
ATCAAGGCGG CGCTTGCCTC AAAGGATTTG ACGGTTGGAA AGGAAATTCA AAAGATGTTC 
ATCCCGC TTG ATACCAACAC TGAAGGGAGA AAGCTTACAT CTGGGTATAC GTGCGATGAT 
CACGTGGAGT TCTTTGGGTA TTACGAAGGC GCGCTCGGCG TTTCTGGGGA CTACTTTGAT 
TACATTAAGT TAGATGATCA GCATTATGCC ATCATAAAAT GCGACGTTGC AGGAAAGGGA 
GTTCcCGCAG CGCTTATCAT GGTTGAAGTG GCAACGCTCT TCCAGAACTT CTTTAAAGAT 
TGGAATATTC AAAGTCATGG TATCAACCTA AGCGACATTG TCTCTCGCAT TAATGATCTC 
ATTGAGGCGC GCGGGTTTAA AGGAAGATTT GCAGC CTTTA CCCTGTGTAT CTTTAATACA 
GTGTCCGGTA CGGTGCACTT TTGCAATGCa GGGGATAATA TAATTCATAT TTACGATGCG 
CAGmAAAGAA AAATGAAGCG TATTACGtTG CGCAAACTTC TGCTGCAGGG GTATTCCCGA 
GTTTTATGAT TGATATGAAA GGTGGGTTTG GTGTGGAAAC CCTCACCCTG CGTACAGGTG 
ATGTCCTGTT CCTCTATACT GATGGCATAG AAGAGGCGAA cGTCTTTTTA GAAACAAGCG 
GTTTGAACTG GTAcTGTGCC AGGAACAGGG ACTTGCGCAT GATGCGCCCC ATGAGACACA 
TACGGTAGGT CAGGCCGGAG AGGAGCTGGG AGCTGAGCGT GTCAGCAGCA TTATCGAATC 
AGTCTTTCTG AGGAAAGGTT TTTCCCTACA AAAGTGGCAT AACCCTGTCG AAGGCGAAAA 
GTTTGAATTT GATTTCTCCT CTTGTGAAGG AAATCTAGAC GAAGCGGTGC TCGCACTTGT 
GGCGGTGGAG CAGGTGTTCC GTATGTATAA GCACCCTCGG GCAACCAACC TTGATAAAAT 
CAGGGTGGAT AAAAAAGTGG ATATGTTTTT AGCACGGTAT TTTGTTCAGT ACCCTGAGTA 
CTGTGCGCGC AAAGAGGTAA ACAGCGAGTA CGAAGAGTAC CTGTATTATA CGTTCATTAA 
AGAAGACGAC CAATACGATG ATCTCACTAT CTTGGGAATA AGAAAGAGAT AGTGC CGCTG 
TTGTGCAGGT TATTGCATGG TGTGTGGGTT GTGACAAGGA GACGCAATGC AGATTATACC 
CATTGCGAGT GGAAAGGGTG GGGTTGGCAA GAGTTTGCTT GCGGCAAATT TGTCCATAGC 
GCTCGGTCAA GCGGGGAAGA AGGTAGTAGT AGCGGATTTA GATCTTGGCG CGTCGAATTT 
GCATCTGGCG CTTGGCCAAA AGGGAAATAA GCACGGAGTG GGAACATTCC TTATGGGTGC 
CTCTTCTTTT GAAGAGATTA TGGTGCCAAC TGGATATCCC AATGTATATC TTGTGCCAGG 
AGATTCTGAG ATACCTGGCT TTGCTGCATT GAAGGTTTCT CAGCGGCGGG CTCTAACAGT 
GGGTTTGTTA AAAACGCATG CTGATTATGT GGTGCTGGAT TTGGGGGCAG GCACTCATCT 
TGGAGTGCTT GAGTTTTTTC TCCTTTCTTC ACGAGGGATT ATCGTTACTG AGCCTGCAGT 
TTcTGCGGTT TTGAATGCCT ACCTTTTTCT AAAAAATGTG GTGTTCAAAA TGTTGTGCGC 
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TGCCTTTAAG AAAGGGACTG GGGGAAGTAT TTTTTTAGAG 
GGCGGTACAG CGCATGTATG TGCCTAAGAT TCTTGCTGAG 
GGGAGTTGCA GTACTTCTGG ATCGGATGCG GTCTTTTAGG 
GATTGCAGAT CCGAAGGATG TGGATAAGGC GTTAAAGATT 
TCTGAATATT ACGCTTGAGT ACCTTGGGGT CATATACCAG 
GCTCTCCTCT GGTCTTCCCA TTGTTGTGTA CAAACCGCAG 
GTACCGGATT GCCGATAAGA TTTTGCAGTC AGAGGGTGAG 
TTATGAAGGG TTGGTGGAAC GAAGTTTTGC CTCTGCAGAA 
CCAGTTTCGT ATGGACTATC TTGAGGATTT GATAAAAAGC 
TCTTGCTGAG ATC AT AAAAG CTCAGCAGTA TGAAATTGCT 
GCTCCTCCAA AGGAAAATAA ATAAGACATT GCGCAATGCG 
TATAACCCCT ATTTGTGGGG GTGTTTTTGG GAGAATACAG 
TGGTGACAGG ATAAACGGAA AACGG TGGCG GGGTAGTGCG 
TGGAGGTTGT GTGATGTTGA GTATTGTCTA TCCGTCGTGG 
TTCTTTTCCC TATTTTCGCT GGTACGGCTT CATGTATGTG 
CATACTGTTT CGCTACCAGG TGCGGCGCGG TGAGCTTGAT 
GCCTGTCACG CAGGATGACA TTATGAGTTT TTTTACGTGG 
AGGGGCGCGT GTTTTTTCCA CCATGGTGTA TGAGGTCGAT 
ATGGCTGATT TTTTGGCCGT TTTCTTTGCA AACGGGTGAG 
GTCGTACCAC GGTGGGTTAA TTGGCGCGCT CGTGGGGGGT 
TGGGAGAAGC TTTCTTGCAT GGGCCGATGT CGCTGCAGCG 
TTTnGnAGAA TTGG 

(2 ) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5861 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



AATCTCAAGT 
CTTGAGCGTG 
CCGAGACTAG 
CGCCGCTCGT 
GATACGCAGC 
TCACTGATTG 
GAGGCGTCTT 
GCAGAAGCAG 
AAAACAGTGT 
ACTCTGAGGA 
TGAACTTCAT 
TTTAC gCGGA 
CGGTGCATTT 
ATTCGTCCGG 
GTTGCATTCA 
AAATGGAGTC 
ACGATTCTGG 
TTGCTGTATA 
TGGGTTGGAT 
GGcTTGTGGA 
TCAACTCCAC 



PCT 

CTGATGCTGC 
TGGATCAGCG 
TCATGAACAT 
GTGAGCAGTA 
AGAATGTCGC 
CCCAGGCAGT 
CCATTGAGGA 
AAGTGGATTT 
GTGTGGGAGA 
AGCAAAATCT 
GAGGGTGGGG 
GyGtGGTGAA 
CCTTACGCGG 
AAATAATTCC 
GTATCGCGTA 
GGGTGAGCGA 
GCATTTTAAT 
TGCGCAAgCC 
TGCGAGGAAT 
CTCAGTCGCA 
TTGGGTATAC 



13041 

8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
9974 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
AGGAAGCACT GGAGCACGTC CGnAAGCACC GTCTCGCCCA TGCGCGTACC AC GGC AACAT 



60 
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AACTTTGAGA TTCAGTATCC CCGGTCTCCG TGCACTGTGC AGTAAAGTGA TGCGTATACG 120 

CTCCCTTTGG GAAATCCC AC ACATGCAGCT GCGCGAACAG CACACACTTC CAGCGTGCAT 180 

CAGAGTCGCG TTCAAGACGC CTTACCTGAG TCAAGCCAAA CACAAGTCTA CGGACATTCC 240 

CCCCAAACTT TTTACGCGCG CGCG r AwTTA CCGCGTGGTC CGGCACCTTA CCTACCCCGT 300 

ACAAACTATC AAGC C CGATG TTCACAAACC GACGCTTCAA AAGCCCCCCG aGCAAACGGG 360 

TGGCCGTCAG ATTCTCGTGC GTCAGGGrCC GTCCTGTCTT CCCATAATCT AGAATACATA 420 

TCGTTGTAGA TACCTGCCTG CGCACGGGAT GCTGTTTTGG ATCAGACGAA GGCGCCGCAA 480 

CGCGACACAC CAAATTAGCA AGCGCAGGCT CCCCCGCTGC GCTGTTTTCT TCCCCCTCGG 540 

GGGACGGTAT GGTAGACACG ACAGgTTCGG GAGATGArGG TCGTTCCTGC AACGCAGAAT 600 

CCTGTGCACA AAGGGAAGGA GCATGTACCA AAGCGGCAGT GTCAAACGAA AGCGTCACCT 660 

CCTGCGAAAC GCCGCTGAcG GAAAGGGGAA AAAGAGCACT CCCTG cGTAT CTGCGCACAG 720 

CGTCACCTGC ATCACATCGC CATGTTCAGG ACCAATTTGG TAGCATAGCG TaAAACGCCC 780 

ACGCTGTAGT GGCCACACGC TGTCGCGATA CTGCAACGTG GCACTAACCC CTACATGGGT 840 

CGGCGGTGCA ACGCGCGGAG cAAGGcGTAT ACCTTGcAGC AGCGCACGGA CATACGCGTG 900 

CAGcGCAcGG CTaCGCGCCG TGTCAGTCTC AGAAGAAGGG TCATATAACG CTTCAAGCTC 960 

ATTGAAAGCA GCGTATGTGT CCCCCGTACA GAGCGCATCC GTGATGCGCA TTACCCTCTT 1020 

CTGTGTCAAA AAACGCCTGC AATGCCGCGT GGTGCTCGGC AGGGATATGT CCAACAGTAC 1080 

CCTGcACCCA CGCAGACCCC CGCCCAGGAG AAAAAGACGC CGGGGATTCA GACACACTCG 1140 

CATCTGCCGC GCCCGCCTCA TTCCCCATTG CCGCAGCATG AGAAACATGC TCAGAAGCAT 12 00 

GGGCCACACC CGAAGAATTC AGGCCATCCG CATTCAGGGA AAAACTTGAG CGTGTCTGTG 1260 

ACGTACAGGC GCCAAAAAGC ACACTCCACA CCCACACCGT ACAAAAAAAA CGGTCCATGT 1320 

ATAATCTACA CCTCTTTATT CTGCAGCGCA CACCACAGCC GCGTGCTAAA GTACCGTCAC 1380 

GGGCCtTCGT TACAGCCACC CTACGATATC CACCAAAGAC ACATCACGTC TTCTTTTCGT 1440 

TAGGGACGTG CAGGACGACG CCGACACCAC CCATACATGA ACGCAAAGCA AAATGCCCGT 1500 

CGCATCGCTT CCGCACGGcT GCGCGCCTTC CGCGCCTCCC CTACCGCTCC TGCCTATCCA 1560 

GCACACTCCC TTTTCACTGC ACAGACAGCG TACCAGCGTG CACGCACTGT GTTCTTCTAT 1620 

GCGCCCCTGC CCCTAGAAAT AGACCCCTAC GCCCTTGCAT ACACTGCAGA AAACGCAGGA 1680 

AAGCACGTAG CTCTTCCTCG CGTATCGGGA AACGACTTGC ACTTTCACGC AgTCACGTAC 1740 

GCGTGCACTA CCCGCCCCTC TGTTTCCTGC TTCACCACCT TGTGCCCTAG GACCAGGGGA 1800 
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ATTAGAGAAC CCGATGCACA CAGTCCACGC CTCTACCCCC CGCACCCTTC GCCCAATACT 1860 

CCTGCACAAA GAACACTTGC CCTACCGCTT TTGATCGTAG TTCCCGCACT GGCATTCAGC 1920 

ACAAATGGCG CACGCCTCGG CCGCGGCGGA GGACACTACG ATCGCTTCCT CGCCCGGATC 1980 

GCCGCTACCA TACCAGCAGG GAGCTACTAC ACGCTCGGCC TCTGCTTTGA TTGCCAAATC 2040 

ATGGCTGTCA TTCCTCAAGA AGCACACGAC CAATCCGTAC ACGCGGTGCT CACCGAAACT 2100 

CGTCTCATTT CCTGTGCCAC GGCGCGTGCA CCAGCGCCAC CGTTCTCTTT ATAGTGCCTT 2160 

ATTCCTCCAT TCTAATCACA CACGTGCATG CACCAAGAGG ACAGCGCCGT GCTATCTTCC 2220 

CAGAAAGGAG GATGAAAACA CGTGAAAACC ATTCTCATAC TGGGTGCAGG AACCATGCAA 2280 

GCCCCTGCAC TTCGCGCAGn ACGGGAGCTT GGGCTGTGGG TGTGCGCGGT AGATGGGAAT 2340 

CCGCATGCAC CsTGCGCGGC ACTTGCAGAC GAGTTTACCC CAATCGATTT GGCCGATAGC 2400 

GCCGCGCTCG TncGCTnCAm gcGCcGCAAT TcGCGCGCrC sGCGGCTTGG ATGCTGTGTT 24 60 

CACCGCGGCA ACAGACTTTT CCGTTTCCGT CGCTGCCGTC GCCGAGGCCT GTGCACTCCC 2 520 

CGGCCACCGA TTGGAGGCAA CCAAAAACGC TACGGATAAA ACGCGCATGc gTGCCTGCTT 2580 

CACACGCGCC CGACTGCGCT GCCCCCGCTT CACGTTCCTT GAGCCTGACT CGTTCGCCTG 2 640 

GGACACACCG CCTGGGCATG CCCGACTGTG TTCCCACCTG CATAGCGCTG GACTCTCGTT 2700 

TCCTCTCGTC GTAAAACCGA CAGACAACAT GGGAGCCCGC GGCTGCACGC TCGCGCAATG 27 60 

CAAGGATACC CTCATAAATG CCtGCGCCGT GGCGCGCCAG TTCTC TCGC A GCGGCCGGGT 2820 

GATTATCGAG GAATTTATTG TCGGAAGAGA GTTTTCCCTG GAAGGgCTCA TATTCGACGG 2 880 

GACGTTGTAC GTCACCGCAC TTGCCGATCG CCACATCTGC TTTCCTCCCT CATTCGTAGA 2940 

AATGGGACAC ACGCTCCCGG CAGCGCTCTG TACACAAGAc GCACAAGCGC TCATCGACAC 3000 

CTTCCACAAC GGTGTGCGGG CACTCGGGCT CACCCATGGC GCCGTGAAAG GAGATCTCTT 3060 

CCTGAGTACC CCCTCCCCGA CGAAAACTCC ATCCACTGCC GCCACACCCA ACCCTTCTGC 3120 

CCCGTACACA CCCGAAGCAG TATTGGGAGA AATTGCCGCA CGcCTTTCAG GGGGCTTCAT 3180 

GTCTGGCTGG ACGGTGCCGT ACGCTCTGGG TTTCGACGTC ACACGCGCTG CATTGCACGT 3240 

GGCGCTTCAC GGTCCTTCAG CTGCCGCCTC GGCTGCCACC GCGTCTGTCG CCCCCCCTCC 3300 

TACTGCGCTc ACCtGctGCG CACACAGCTC ACCACTCTGT CTCCTCTTCC AGAAAAAAGC 3360 

CCATACGCCA GCGCAGAACG CGCGTGGATT TCCATTCCTG GGGTAATACA CCGAATCTGG 3420 

GGCCTTGCAG ACGCTCAACA GATCGCCTAC GTCAAAAACG TGTTCGTACG TATGCAGGAA 3480 

GGAGCCgcgG TGCGCTTTCC TCGTAATAAT GTGGAAAAAT GTGGCAACGT GCTGAGTCAG 3540 
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GCCCCCACCC GTGCACAGnT 
GCCTTGTTCC TGCACACCCT 
CAGCGGCCAG CCCAGCGCTC 
CCTTTGGGCA AGAGAGTATA 
CTGAGGTTGC CTGTGCACCG 
TGCGCGctGA cGCACGAGAC 
TTAAGGTAGA GCCTGCGCTC 
AGTTATGGCG CGCACTTATT 
TTCAACTGTC CTGATGTTCA 
cCTCAGTCAT ACGCAGCCAC 
TTTTTTGCTC AAAACCAGGC 
GGACCCCTGC GCGCTGCATA 
CTGCTACGAA AATACGCTCA 
CCGCAGTCCC TTGAGACGCT 
GGTTTTCTCC TTTAAAGAAA 
GCACCGCCTC CACGATAAGC 
CGGGCCAAAG GTATAGAGGC 
CGGTCTGTAC ATAGCCGAC A 
AAACTGTATC AAAAGGACCG 
CAACGTGCAT CCGCGGGTGC 
GCaCGCGCAA GAACTCAGCA 
CACCGCGTGG ATCGCAAAAA 
GGAAGCGCGC GCAGGAACGC 
GCTAGACCCG GAACACAGGC 
GCAAAAACTG TTCCCATTTC 
TCGCAGTAAT CGGGATACCC 
TAAAAAAGAA ATGAAAGTCA 
TGATATCCAC ACCCGTAGAA 
CCAATAGGAA CTGCGCGCGG 



PCTBB98/13041 

590 

ATCGCCGCAG CAGAAACCGC GTGTCGCTGC ATTGTACTCC 3600 

GCAACAGACG CCTTTCTAGC AAGAAAACGC AGCGCAGAAT 3 660 

CAGGACGCTG ATTCTGAGTA CGCAGCGTCT GCATCACACC 372 0 

CCGGACATCG TCTGCGATGC CTC AGGACGC TTCTTTACCT 3780 

CTCGTGCGCA CAGGACTCTT CCTTATCCCC GAGCCACTGG 3 840 

GTGCAGGGTC GCAGCATCC A TGCGCTGTGT ACCCTTGCAC 3 900 

GAAC CTGCGC TGTGCTTTGC GCGTTCCCAA AACCTCGCAG 3960 

CGCGGTGGCA TTCAAGGATT ACTATACGCG TTTGACTCCT 4020 

GTGCCAGAAA AAATAAAACG CGTGCGCAAA AGCTCTGCGG 4080 

GTCAGTCCCT GTGCACAGAG AGTCTGCACG GACGCGGCCG 4140 

ACCACGCTAC TACCGTCCTT CAACAAACTC ACGCGTGAGA 4200 

TCCTGCAGCG TACACACAAC ACAATGCGAG AGTGCTTCAC 42 60 

TGTGCGCGGA GCATTTGGCA GCGCTTAACA AAAAGAGAAT 432 0 

GGATACTCCG AGGACAGCAC ACTAAATTGT TCCACACAGG 4380 

AACTGAGGAT GTCTTGTACG ATGCGCGCGT TGCCAAAAGC 444 0 

GGGTGCACCG CCTGTCCCCA ACTGCCACGC ACACAATGCT 4500 

CCCTTTCCGG TATATGCACG GAATGCCAAG TACCCCGCTA 4 560 

CGCACAGGcA CACACGCCCC AGAACGCAAA CGTTCGAACG 4 620 

AGAGCATCTC CTGTCAGGGA GCGCCAAAAA CAGGGGTGCG 4 680 

CGATCGCAAc TTACGTACAA TGCATCCACA TGCGCAGCGT 4740 

ACGCGCACAC AGTCCTGATC CGCGCCGGGA ACGAACAACG 4 800 

TCATTTTGAA AATCAACCAA AAAAAAGGCT CTGCTCATAG 4860 

GCACGcgCGT GCGCgcAnTn ACCCCACCCC AGCAGCACAA 4920 

CCTATCTGCA CCGTCAGCGG ACACTCCAAA CAATACTTTT 4980 

AAATCAGTGG AGCCTGCGCC GTTCTTACAC TGCTCGAGGT 5040 

ACACCGCTTG CCACACCGAC AGACAGTCCC ACAACCCCCG 5100 

AGGTTCACAG GAACACCGAC TATTTTGTCC TTGGTAGACA 5160 

GGCAGAAAGA ACGCCCGCTC TCCCATACGG AGTACCCACC 5220 

AACAGGAGCG TAGAAAGCCC CGCATCTAGC TGcGTGACGA 5280 
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AAGTAAACCC GTTCTCCGCA GAAACCCCCA CCGAAAGCCC CAGCGTGGGG GTGAAGGCCA 5340 

GTATATCGGT GCGTTTCGTA nCTTGGTTCC GCCCTCAnCG TTCGCGCCcT TTCCCCACAC 5400 

AAaGAcTCCC ACCTGTCCAA cTCGGGGAGA AACAAAAACt GCGCGGCGTT CGCGTCAGTG 5460 

CAAgACCCCA CAACACCGCC AAAAgAGCGC ACCCCCCCCC gCCACCGAAC GGCGCAgCGG 5520 

CACATCACCT CACCCTCACA CCAACCACTC ATACACTACA CTCGGAACGT CGGCCCTATC 5580 

GTGAGCGAGA GCGGCAAGGT AAACTCCTTG AAATTAAAGT CACGAACACC AACGGCCGTG 5640 

CTCGCAGCAA CCGCAACTCC GGCAAAGGAA GTGAGATAGT ACTGCACCTC TAAGTTCAAC 5700 

GGTACGCTGT ACAGCAGCTT GC TATACC AC GCAGACGATT TCCCCTCAGA TGTCGCACAC 57 60 

GAATCACCGC AGATATTCAC CCCACTGGAA ACGATGGCCC GCAGCCCTCC CACACGCACC 5820 

GCGTAGCCAA TTAACGCCTG TGC AC GC AC A AAGACATTGG T 5861 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CGAGTAGGAG ATACATACCG ACACTCAGGG TTTACACACG CAGTATATGT GCCGACTCTG 60 

CGATTTGATT TTTCAACCAA AAAGCATCGA CACTGAGGAC ACACTGCAAA GGTCGGTTTA 12 0 

AAGTGAGTGA CAAAATCGCA GACAGGGAAA CGCGTGCAGC CATAGAATTC CTTTCTTCCC 180 

CTTGTTTTTT TCCCCACGAT ATTCCCATCG CACGCAGGAC GCGGACACTT TGCAAGAGGG 240 

ACAGGCTGAG TATTTC TACA CTCAGGAAAC TTTCCGCATG CAAGGAAAAA TCCAAACCTG 300 

CCCAGTTTTT TCACCATCGT ATCACCACAC TGACTACACA CCACATCTGT TTTCTCATCG 360 

AACACACCGC GCATGCTGTT AAGATCTTTC ATCACCGTTG AAACCTTTTC GCTGAAAGCA 420 

GGATAGAAAT CCGCAATGAC ACAATTCCAC TTGATTTTAT CTTCCTCCAC CTCATCGAGT 480 

TTACTTTCCA TGCGCGCGGT AAAACTTACA TCAACAACAT CATGAAAATA GGTGGTGAGA 540 

AGATCACTAA TGACCTTTCC CAATGGGGTC GGCATTAGCT GTTTTTGAAT ACGAGTTACA 600 

TAATAGCGAT CCAGCAGTAC TGAAATAGTC GGTGCATACG TTGAAGGGCG CCCAATTCCC 660 

TTTTCCTCCA ACATTTTTAC GATACTTGCA TCCGTGTACC GAACAGGACC tGCGTAAAGT 720 

GCTGTACGGA CTGCACGTTA TGTAGTGCAA CTACCTCACC TTCCTTCGTA GGGGGAAGTA 7 80 
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592 




1 


i 

F8/13041 


CAGCTTTAGA 


GAGATCTTTG 


GGGGATAACA 


TTTTCAGTAC 


ACGGTAGAAT 


CCCTGTTCAA 


840 


TAACCTGCGT 


TTCAGTTGCA 


CTGAAAACCG 


CCGGGCCAGC 


GGTAATTTCA 


AACGTCAAAC 


900 


TGCGCACTCT 


TGCATCTGTC 


ATCTGACTTG 


CAACAAAACG 


CTCCCAAATC 


AACGTGTACA 


960 


GACGTATTTG 


ATCACGCGTA 


AGGTGCGCTT 


TAATCCGCTC 


AGGAGTGTGG 


GCAACATATG 


1020 


TTGGTCGAAT 


CGCCTCATGT 


GCGTCCTGAG 


ACTTTCCCTT 


TGCAGCGTAC 


CGATTGGGAG 


1080 


TACCCGGCAG 


TGCGTCAGAA 


AAATGCGTTG 


CTATCCACGC GCGCACTTCC TTTACAGCAG 


1140 


CTTCAGAAAC 


GCGCACCGAA 


TCTGTACGCA 


TATATGTAAT 


GAGCCCCACG 


CGGTGGGTAC 


1200 


CAAGAGATAC 


GCCTTCATAG 


AGCTGCTGCG 


CAACCTGCAT 


CGTTTTACGC 


GAGGTAAACC 


1260 


CGAGCCTATT 


GGCAGCGCAT 


TGCTGCAACG 


TAGAGGTAGT 


AAAGGGCTGC 


TTCGGTCGAA 


1320 


CATTTTTTTC 


AAAACTGCGT 


ATTTGAGAAA 


CTCGTGCCTC ' ACTCTG AGAA AAAAGACCGA 


1380 


TAGCGCTTGT 


AGCCTCCTGT 


TTGCTTTTGA 


ATACAGCCTT 


TTTCCCTTGA 


ATCAGTATCA 


1440 


GTAGTGCAGA 


AAATGACTTT 


TTATCCTTTT 


CAAACGTTCC 


TTCAACCGTC 


CAGTATTCTT 


1500 


C TGG AAC AAA 


GCGCTTTACT 


TCAACTTCTC 


GTTCACAGAT 


AAGACGAAGT 


GCAACCGACT 


1560 


GCACACGTCC 


TGCAGACAAC 


CCGTTTTTCA 


CCTTATGCCA 


CAGGAGCGGA 


CATAGGTGGT 


1620 


ATCCTACCAA 


ACGGTCCAGT 


AcGCGCCGCG 


CCTTTTGTGC 


ATTGACCTTT 


GCGGTATCTA 


1680 


TTGGAACCGG 


ATGGCCAATT 


GCCGCCCTAA 


TCGCGTGCGG 


TGTAATTTCA 


TTAAACACGA 


1740 


TCCTTTTGAT 


CGGCG TATC A 


CAATACGCCT 


GG AT AG AC TG 


TGCAAGGTGG 


TACGCAATCG 


1800 


CCTCCCCCTC 


TCGGTCACGA 


TCGCTGGCAA 


GAAACACTTG 


CAGTGACTGC 


TTAGATAGGG 


1860 


TGCGCAACTC 


TTTTAAACAC 


TGCGCACGAC 


CACGAACTGT AATGTACTCA GGCTGGAAAT 


1920 


CGTGCTCAAT 


ATCAATAGCT 


AAACGAGACT 


TTGGCAAGTC 


AATAACGTGG 


CCCATGGACG 


1980 


CTCGCACCAC 


GTAtGCGTTC 


CCAGATATTT 


TTCGATGGTC 


TGCGCCTTCG 


CAGGAGATTC 


2040 


CACAATAACC 


AAATGcTTCC 


GCGCAAATGT 


CTTCTGCCTT 


TTCGGTTGTA 


GCCCACGCAC 


2100 


TTCCATGTTT 


TCCGCCCCCT 


ATGCTTACCG 


AAACTGTCCT 


ACGTCCGACC 


CGTACATGCG 


2160 


TATTCCCAAT 


CGTCAAACAT 


ATCCTGcaCG 


CACGTCAGCG 


CGCGTGCACC 


TTCTGCATGC 


2220 


AGAAGTCTTC 


CTCCCTCATT 


TTGAAGACTT 


TCAAGCAAGG 


GTTCATACAC 


GTACACATCA 


2280 


CGTCCCTGTT 


CCAAGGCACA 


CAATGCGGTA 


ATCAACGCGC 


CTGACTTCTT 


TGGTGCTTCC 


2340 


ATAACAACGA 


GCGATCGTGC 


CAAACCTGAA ATGAGCCTAT TGCGTTCAGG AAAACGATAG 


2400 


CGCATCACAT 


GCTCAGATGG 


CGCATATTCA 


CTCAGAATGC 


ATCCTCCGGT 


TTCTATAATC 


2460 


CGTGCAGCAA 


GCGCGCTATT 


TGAGCGTGGA 


TATAACTGGT 


CTACACCACA 


GGCAAGTACT 


2520 
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GCGAGGGTGT ACCCACCACC TGCTAATGCT CCTTTGTGAC AG AATCCG TC 
GCAAGTCCTG AAACAATGGC AATGCCCGAT TCAGCACACG CCTTGGAAAA 
TTCCGAACTC CTTCACCGGT TGGAGTACGT GTGCCAACCA TCCCCACAAT 
GCACACGGCA AAGTGCCCCG ATAGAACAGC AC AAACGG T A CATCACTTAT 
CAGGAGGgAA ACGCATCATC GTCTTTGAAC ACCATTTTTA TCTGATAACA 
TGTATACCGT GCCGAACGAG GTGAGGCAAC GCAGAAAGTT GCGCACCCGC 
TGCCTTTCTA CAACACGCTC AAAATCACGC ACCTTCCATG CAGTAAGCTC 
CCTACAGCTT TTGAAACGCG CAACCGCTCC CCACCTTTTA AAAAG TG AC A 
GCAAGAGCAA TCTTGTGCGT TTCAGTGAGT ACAGTATCCG TGTTCATCCA 
GCGTAcACGC TGCGATACAT TCGTGACTAA TTCTTTATTC TGTCGCGCAC 
ATGGGAGTCA CTGCTTTGGA CAAGGATGCG ATCGTAACAC TCTATTGCCT 
TCCAAGTGTG TAG T ATGCG C GCGCAAGAAC GAAC AAAGC T GCTTCTGTAT 
TTCGAGGAGC TTTTGCAAAT GAGGAACCGC GTTCTGTGGC TGATTCAGTT 
CAATACAGAA AG AC CGT AC A ACGCGTGGGA ATACTGCGCG TCCAACGAGA 
ATACGCAGAG ACTgcTAACG CGCGATAAGC AGCTTGCTGC TGCATTTTCT 
AaTAGGAGCG ACGTACTTAG CCGCATATGC GGCACACAAC GCCTGGTAAA 
CTTATTTTCG GGAGCAAAGG TAATGGCCTG GGTAAACGCA TCAAGCGCGT 
CTTACGATCG AAGTAGCGCA ACGCGAGCAT CTTGTACCAA ACACCCACCT 
GCGTGCAAAC GCTCGAGGCG CTGTTCGTGC AGCTTTACTG CCCTCCGCAG 
GAGGTTGGAT GGGGCACTCC TTTCTCTAAA TCCT 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



PCT/ 
TATTCCACGT 
GGCCAAACTG 
AGGTTGGGTT 
TTCTCTTAAC 
CCGCATGGTT 
AGTGcGTATG 
CTGArAAGAA 
GTAAGAGAGC 
CGGTCCCCAC 
GATCTCTCTG 
GAGCATATCG 
TTTCTTTCAA 
CAAATACATA 
GCGCACGCAC 
TCTCAGGATC 
AAAAAAGATG 
GCGTATACAT 
GATnCnCnGT 
TTCTTCAATA 



13041 

2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3694 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

TTACCTaAAC CGCGAATTTC CATATGGTGA CCGaTACTTT TGTGCACCCC CCGCCCAATG 60 

AGGCTATTTC CATTGACGCA AGAAATTTCT ACTAAGTCAT CTGCAACAAG TCGATGACCG 120 

CGCTCAATTA ACTCCAGAGC AGTCTCACTT TTTCCTACTC CTGAATCTCC TGAAATAAGA 180 
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PCT/ 



594 



ATCCCGACGC CATACACCTC CACCAATACT CCATGAAGCG CTATCGTCGG TGCGAAGATA 
TTGGAGAGAA CACGCATGAG ACGTAAAGAA AGCTCGCTCG ACGTAAGACG AGTGACCAAG 
ATAGGGCAAG AAGAAGGCTC AGCAAGATGC AAAAACTTCT CCGGCGGGGT AATTCCATGG 
GAAAAGATAC AACAAGGCAA GTCAAAGGTG AACATCTTTT CGATAGCACC GTATCGTCCC 
TGCTCTAAAA GGGCGAGCAG ATACGCATGT TCTCCGCGGC CAAAAAGCTG GATCCGCCGG 
TAGGnAAACA AGTCAAAAAA GCCTGACAGG ACAAGACCTG GTCGGTTCAG ATCCGAGATA 
GTGATGGGAT TTGCCAGTCC ATGGTGACCT GCGATACAAC GCAGAtCAAG CGAATCGCGC 
TCTTTCAGAT CGAGCTTGAG CACATCGAGA ACGGTAAAAA GAGGAGCACC CACGGCCGCT 
ACTGTAGCAC AAAaGCCAGG ACCCGTAAAC GAGCCCACCG CAGTGAGAGC CTCTCTTCAA 
GAGAGCCACA AGTCCGTACA GGAACTCTTG ACTTTTACAT ACAACTGGTG TCTGCTGACG 
GCGCCCGGGT GTGGGGCACA CGATCCATTA GCTCAGCGGA GAGAGCGGCC GCCTCCTAAG 
CGGCAGGTCG GACGTTCAAG TCGTCCATGG ATCAGGAACG GCGATGGTCG GGCGAGGGGG 
ATTTGAACCC CCGACCTCTC AGTCCCGAAC CGAGCGCGCT AcCACTGCGC TACCACCCGT 
GCACGCAAAG AAACCACACA GAAAGGGACG CGCACCGCAC AGTGCGCAGA CGGGAGCGAC 
GGGGCTCGAA CCCGCGATCT CCGGCGTGAC AGGctGGCGC GATAACCAAC TTCGctACGC 
CCCCAGAACT TGCGCGCATC CTACATCACC CGCACAAAGT TATCAAGCGG CGATGATAGA 
TCACCCAAGG AAATAGCGGC AATAGGGATT GAACCTATGA CAGCGCGGAT ATGAGCCGCG 
TGCTCTACCA ACTGAGCTAT GCCGCCAAAA AACCCCCGAC CGCACACCAC CGCCATCCTA 
TCCCTTTTTT TTACACTCTG ACAAGTCCTG CCTTCCCCCT GCCTCCCCGT GCTTGACGCA 
AGAAACAATA AAATTGCTGC CTATGATTTC TTTAGCCGCG CGTATAATGC GCCGAGACAC 
CGGGCGTGGT CTTTCGTCGC ACAGTGGCAC TCGCGCTTCT TCTGCGTACG CTCCCCTGCG 
CCGCGCACTT CGGAAACCGT GACCGCACGT TCTACGACCT TAACAACGCG CCCCTTGCTC 
TGCGCGCCAT CCAGGACGCA TATCCTCATC TCAACGCGGT CATTGCCTAT GACCCGCGGG 
AACAGGACTG GCTCATCCGT TCAGACGGGC GCACCCTCTA CTGGGCAGAG GGGCGTCTTT 
TACCTCGAGA ACACCGTGAT CAAGCCCACG ACTGGCGCCC CATCATCGAT TATGTCTACG 
CGCGAGAAGT CCTAGACCCC GCGCACCTTT TTCCAGAAGA AATACACGCG CTTAGGCCTA 
AGACGCTTGC AATTAAACGC AGCGCTACAA AACCCTATCA CGACGCTTTT TTCACGTGGC 
TCTACGGTCC TGCCACACGT TCGGAAATCA ACGCTCGTCT CGCGCGCGAC TATACGTTCT 
TAGGAAAGCC CGTATACGTA CACAAAGCAC TCATCACACG CTTAAACGCA GTACAGGAAA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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AAATCCTCAC TGCCGCGAAG ACGCATGCTC ACGTACAAAA GTTTATCGAG GATCTTTTAC 1980 

GCGTCGACGG CTTTAACTGG CGTGAGATTT CTGATTCTAG ACAAAAGAGT AACCACAGCT 2040 

GGGGGATCGC GTTGGATCTT ATGCCCAAGA ATTGGCAACG CCACACCATG TACTGGAATT 2100 

GGGAArctGC GCATAACGAA GATTGGATGC ACATCCCCAT AAAAAAGCGC TGGGCTCCAC 2160 

CTG CAGAAAT CATCAGTCTT TTCG AAAGC G AAGGGTTTAT CTGGGGCGGA CACTGGATGC 2220 

TGTGGGACAC TATGCACTTC GAATACCGGC CGGAATTACT CGCTGTACGT AAAATCCTTG 2280 

CCGAGGGGAA CCGCTATGAC TTTCAAGAAC AAAAT AT AGT GGTGCATGCA GATGATTTTC 2340 

CTGCGCAATA CTTTTCTCCC AAAGAAGTAT TCGGCACAGA TGAGAAGGAG CACATTACCT 2400 

ATGCAGAATC CTGCGTtCGT GCAAcGCAnG CAmaGTGTTA AAGAACTCGT TCGTGCACGC 24 60 

ACGCTGGTAG CGCGGTTTTC TCCTATGCGT CGGCTGCACG TGTATGCACC TCCTG AAAGC 2520 

ATTCACAACA GCATAGATAC AGCCCTATTA CGCATGACCG CACAACTGAA GAAAAATTAC 2580 

ACAAATGCGA AAaTACGGAA CAATTCTCGT TTGCTTTCAA AAAgCATGCT CAGACACGCG 2 640 

CGTCTCGCAG AAGCGCAGAT GTGTACACGG TATCGTGCCG TCATGCTCAG GCAATCGCAn 2700 

TGCAGTATCC ACACGCCCTG TCTTGGCAAA GTAAAGAACG AAGCGATGCG CTGTGGATTG 2760 

CGCTTTTTTC CgTACGGCAA GAAGCGGCAC GACGCTCCGT GTGCACACCC TCGTCTAAGG 2820 

AACAATGCAT GACGCACGCA CTTTCTTCAT GCGTGGATCT TGCACGTACG CACATCCTGT 2 880 

TGCCATAGGG CGCTTCTTCC CCCTCTCTTC CCCTACTCAC ACACCACAGG GTACACTTAT 2940 

GAAAAGTCAT GGCACCATGT GCTCAAGGAA TGCGCTTCTT TTGCCGAGAA GGGGCGCAGG 3 000 

GCTGCATGTT CTTACCCCAC GTATACgCGA GGCGCGACCG GTGAACACAG GCGTTAAGGT 3 060 

TATTCTCAGT CTATTCGCGA CGCTCGTCCT TATGGTGGGG GTGTTTTTCT GCGCACCACG 3120 

CGCTTCTTTT GCCGAGTTTG AAAGACACTT TTACCAACCG ACTGTTCTCA GTGCGCTCTC 3180 

TACCAACTTG CGTGAGGTCA GTAAGGCAAG TGAGGCTTGG CACAGTCGAT ATCGACCCCT 3240 

GTTTTCTCAG TTCTGTGCGC TTGATGCAGT CAGAAGTAGT TTCGATCCTG CGCAAAAGGC 3300 

TGAAGACATT ACACAACGTG CCCGGGAGGC CAGTGCGCTC TTGTCTTCTG TCGCTGGTCT 3360 

CAAAGGGGTG CGTATTGTTG AGGCGCAGAA ACCAAATATC CATTTTTCCA CCTTTGAGTC 3 420 

CGACGTTCTC CTTGCTGACA GTGGTTCTGT AACCTACAGA AAGTACAACG CTGAGGAGCA 3480 

CGACGTCCCT CTTCAGTTTC TAGGGGAGCA TTCCCCTGAA CCGAAGtTAT TATCGACGAG 3540 

TACCATGATG CGCTGCTGTA CTCTTTCCCC TCCCTGGGGA ACTACGGGGA ATATCGTGGA 3 600 

CGCATTCTTT TCTACTTGTC CTTGCGTGCC TTGGGCACCC ACCTTATTGC GGAAAACAAA 3 660 
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CTGAAGATCA CAGACAGCAT TGTTCCGCTT TCCGCTGATG ACtwaCCTTC GGTGGCATCG 3720 

TTATTGGTAT CCCCCATGAG GGGGTACGTT CCCTCAAACC CTCTGTGCTC GCAGAGTGGA 3780 

AGCGCAAGCA GTTCAGGGTA CAGACAGTCA GGAGTGAGCA GCACGAAGAC TGGGCACTGC 3 84 0 

TCAGTAATGC ATCAGGCGCC TTTGTCATTG CACAGGCAGT GCCCGTCTTG CTGTTTGGCT 3900 

TTACCCCTCT GACGAAGGGC CTTGTCGCTA TGGTTGCTGT TGTGACTACT TTTTTGCTCG 3960 

TATTCCAGTT GCTCAGCCTT CGCCAGGACC CCCTCACAAA ACTGAGGGAC AGGCTGATAC 4020 

ACTTCCaCGC GCAGCTCCTA CACAGTTGTC TCGAACAGAA GGAATCACTC GAGTGGGAGG 4080 

AGGTGCGAAC CCGACTTGAA CACCGCAGGC GGGAAACAGA TGCAGAAATG AAGAAGTCTC 4140 

TTCCCAGGCG TCTCCGTATA AGGCGGGGAC GCGAGCTCGA TGCGCTCCTC AGTAAGGGTT 4200 

GGGATGACGT CTTCTCCACC TTGGAGCATG GTTACGGTGG TGCGCGTGCT ATGAACCGCG 4260 

CGCAAATCGA ACAGCTTGTC AGGGAAGTgc TCGCGCAGAG CCTTGCAAGT GGGGAGGCTG 4320 

TGCTACCTGT GGCGATGCGT GCGGACACAG CCGATGAAGA GCTCG AC GAG GTGCTAGAGG 4380 

AACTCCCTGA CGAGGCAGCC TCTTTGCCTT CCGATTC C AG TCCGGAAGAG GACCTGGACC 4440 

CCTTGGAGGA AGTCGAGAGT ATCGAGGGGA CTGCTGAAGA AAGCACACGC GAGTACGCGG 4 500 

CTGCGGGAGA CGCGCTCCTC TCGAAAACAC CCCAGCTTTC AACGCACAGC GAG T ACGTGC 4 560 

CGGCGACACT CGCAGAACTC CTGGGCCGCA ACGCAGAGCC CGGCGACGTC GTGCGGGACT 4 62 0 

CAGCAGTCCT CGAATATATC GAAGGCtCTT CGAC TATCGT CCCTGCTGTT TTTATGAGAG 4 680 

CCACGCTGTC CACGACTGCC TAGAAGTAGT CACGGGAGAA GACGGCCCCT CTCTCAGCCC 4740 

TATGGAAAGC ATCGTCAGCA CCGAGGACGG TCTTTTCACC ATTCGGGTGA GTAAGGAGGA 4 800 

AGGAAACCAC CTCAACCGCG ATTTCAAGGC CCTGGTGGAT TCCGTACTGT ACTGAAGAAC 4860 

ATATCTTTCC GCCGGTGGAG CCGGTCCTCT TACTCGGAAG CAGGCACGAA CGTGTGCGCC 4920 

ACCACGTTGG TTTTTATGAG CTCATCGACT TCCGTCTGGG AAGACCTGAG CCAGTACTGG 4980 

CTTCTGCCaA TCCCCCAGCT GATTTCCCCG CGCATCTGCA GCGTATCAAm CTTGTAGCTC 5040 

CTCCCATCCG CcAGGGATGA AATTAATTTT GCAGTAATAG TACTTACCGC TTGCAGGATC 5100 

TATTACGTAT CCACCACCCC AGGAGCCTGG AGAAGTACGC TCGAGGTTAT AGATGAAgGG 5160 

CGTACCAACC AAGGGCATAT TGGCGACGTT TCCCTTCTTG GAGAAATCAG GATACGTTCT 5220 

TGTACACGAA ACCACCGCCG CATTCGGAGC CCTCCCCATG CACACGAGGA TCTTGCCAAA 5280 

GAGCTTACCA TCCTGAACAT ACAACCGCCA CACCCCAGTG GGTTTCCCTG TATTGTCATC 5340 

AACGCTCTTC CAGATACCTT CCACCGGATC GGCCATTTCC TTCTGCACAG ACGGCACCTG 5400 
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CGCTGCCTTG TCCGAGCTTG CTGTGAAACA CGGCACACAC AGACACAGTA 
TACGATAACC TTTC TCATAG CCCGCCCCCA CAAAATATAA TGCGCACAAC 
AAATACGACC TGCAAAAAAG CAGGTACCAT ACGACGCACC CCCCACATAC 
CCGTGATTGT GCAAAACGCT CTTGAAACTC CAACACTACA TCCGATATCG 
ATTAAGAGAA ACTAATTTCC CCCGCTCACT GTAAAAGTGG ACAATGGcTc 
CGATAGGCGG TAAGCCTCTG AAGAATTGCC GACATCTTGT CATCCTCCCG 
ACTCCTCTAC ACCGATCGCA CACACCCTCT CTCTTAnGsT GCGC AAAGAG 
CTGCTCCCAC AGGCCGmACA CACCtGCGGC CAGTAAGACG CGCAACAAGG 
GtACTACAAT ACTCACCGCG TAGTCTATCG GCACAATGTC CTCTAAGCAC 
TGACAGTGCG AGGAAACCCA TCTAGAATAA AACCGCTAAC CACATCTTCG 
GCTCCCGCAC TAGCTCCGTA ACGGTCTGGT CATCTACCAA GCCGCCCACT 
TTTGAACTTT TTTACCTAAT GCCGTCTGTT TCTGAATTGC TGCCCGAAGA 
TGGAGATGTG CACAACGCCA CAACGCCCAG AAATTTCACC TGCAAGCGTA 
CACCAGGAGG ACCAAGAAAA ACAAACCTCA TGAAACAAAC TCCACCTTAT 
GGAAGAAAAA ACACACCTCA CCCCGTTCTC TCGCGCACAC AACCGACCCG 
ACGCCGCGCG CGCCGCGAAA CCCCGACCGC CCAAACAAGA GCGCACCGGT 
TACACGGGGA GGATTCATTG GCAACACAAA ACTGCGACAT GCTCGATAnA 
CA 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 6 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



CGACACCATA 
CAACAAGGGG 
ATCGAGCTCA 
GCGGAGCACC 
CGCCTGCGCT 
CACGACCAGT 
CACATGATAA 
ACATCGTCCG 
CTAGCCTGCG 
TGACTGACAC 
TCAACTACTT 
ATACCCCCTG 
CCCTTACCGG 
CTTCCTACGG 
AAGgCGTtAC 
ACTCCAATTC 
nCCTTATGTA 



1/13041 
5460 
5B20 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6422 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CTTCAAAACC GCCAAACAGT ATAGAAACAA ACAGGTTTAT CATGTAAGGA AGAATTACGC 60 

TCAGTAAAAC AAACGTAACT ACGATTGTAA GCGGAAGCCC TTCAGAGAGT AGATTCATTT 120 

GTGGGGCCGC TTTTGTTAAT AAACCCATTG AAACGTGGAT TAACAGCAAT GCTCCCATGA 180 

TAGGCAGTGC GATAGTCATC GCGTGTAAAA AAAGAGCACT CAACGCTTTG GTAAAAAACA 240 

GCAGGAGCGC TTCCTGTTTC CGCAGAAAAA CAAAGCAATT AACAGCCTGA AAGCTCCGCA 300 
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GCACGCCTCC TAAAAACAGG ATTTGAAATC CTTTTATTTG CAAAAAAACA AGCATCGCCA 3 60 

CAAAGTTCAA AAACTGTCCC ATCAAAGGAT TTTCTATTTG TGCAAAGGTA TCGTACATCT 420 

CAGATGTTCC AAAACCCATC TGATACGAAA AAAACTGTCC TGCCGCACTA AAAGTCGTAA 480 

AAATTACGCT AATAAAAAAA CCTGTTAAAA TCCCCAGCAA ACCTTCTCCG AGCAACAAAA 540 

GCACATAGTA CGCACTAAAC TCACGAACCT GcATGGGTGC AGGGTACGCA AGCGGTAATA 600 

CGAGGAATGC AATCAGGcCT GcGAGTGCCA CCCTCACTaC CCGAg aAACC GAgCGCACCG 660 

AcAAGaGaGG TACCGTAAAC ATaAGcGcAA ACACGcGGAc CGCCGwCAAG aAAAAAAAGA 720 

GAAGCCTGAG aAAAgAGTGC ATCAAAGGAC CGTTCCATCG CACATATCCA CTAGACAGGT 780 

CCACTCCTCA CTAACTGAGG GATAATGTCA AACAGCcTTA CGGTATAATT CTGCAGCATT 840 

GTCAGCATCC ACCCACCGAG GAGGGCAATC ATTCCCAATA TGGTCAACAT CTTAGGAACA 900 

AAGGTAAGTG TTTGTTCCTG AATAGACGTC ACTGCCTGAA AG AT AGC C AC TATTAAGCCA 960 

ACGACAAGCG CTGTGCACAG AACAGGCGCG ACAAGTAACA CCACCTGAAA AACACCCTCT 102 0 

CGTATCAAGC CTAATACCGC ACCTTGCGTC ATCACACACT CCCGTCCTGT ATGTTATAAA 1080 

AACGAATGAA AAAGCCTATC TATCAGCAGA TTCCAACCGT CCACCAGCAC GAACAAAACC 1140 

AATTTAAACG GCAATGAAAT CTGAACCGGC GG C AGC AT AA TCATACCCAT AGACATCAAA 1200 

ATACTCGCTA CAACCATATC AACAATTATG AAAGGTAAGT ACAGGAAGAT ACCAATCTGA 1260 

AAGGCTACGG TCAGCTCATG CAGGATAAAA GCAGGGATAA GGACATACGT GGGCACGTCC 1320 

GCAAGTGTAT CTGGCTTAGG CAGCTTTGCC ATGGACATAC AAAGACGCAC AGAAGACGGG 13 80 

TCATGCGCCA TCTGACGATA CATGAAGACA CGCAGCGGTC TTTCTGCCTC CGTATATGCA 1440 

GTCTGGATAT CTACCTGGCC ATCGGTAAGA GGTTTAAACG ATTTGGCATA AATCTCAGTA 1500 

AAGACCGGCC ACATGATAAA CAGGGCGAGA AACAATGCTA - TGCCGTGTAA AACCTGTGTG 1560 

GGCGGCACTT GCTGCAGCGA CAATGCACGT TTGATAAAAT CAAGGACGAT AGACAAGCGC 1620 

AGAAAGGCAG TCATCAAAAG CAAGATACTC GGCGCAAGGG AAATGAGCGT GAGCAACAGG 1680 

AGAAGTTGCA CAGAAAAAGC CACTTCCCGA TTGGTCTGGG GCTCCCGGAT ATCAAAATTG 174 0 

ATGAAAGGAA TGCGTGAAGC CGGCCGCTCA GCATTGATAC CAGTAACGCC GCGCTCGACA 1800 

CCTCCGCGCG CATCCTGTGC AAAAAGCGGG AAGAAAAACA ATGACACGAA AAAGAGCGCG 1860 

CGGCGTACGC ACGCACGAGC ACGGATCACA AAGCATCCTG TGCAGCAGAT TCTTCAGAAT 1920 

CATTACGAGG GATGCGACGC AACTTCTTTC TCGTATCTGC AAGAAAATCT GCCTCAACTA 1980 

ACGGCTTCCC CTTACCGGTA ACGCGCGCGG GCAAGAGGCG CGCGAGCATC TGAGAAAAAT 2040 
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CCGCACGGGC GTCAGTGCCC TGCTCATCAG CGACGATGTT CATGGTATCG ATGAGCTCTT 2100 

TGTCCTTGAC CTCTGCAATG AGTGAAATGC ACGTATCAGA CGC TGCCAAT ACAAAGGCGC 2160 

GCTCTGCAAG TCCTACCACG TACACTGCGC GCCCTGGCGc AATGGGCAAA CAGGCAAGCC 2220 

GCTTCAAAAA TGGATCGTGC GCGCTAGAAA GAAACGcAtG CGTCTGATAA GACGCAGAAA 2280 

CCCGTATACA GCCGCACACA CCACGCAGAG CACGAGACTA AAACGCAACA GAAGAGAAAA 2340 

CACCGAAGGG GACGGGTCAC GCGCAACCGG GGCAGCGTCG AAACGGAACG CCTGTTCAGC 2400 

AGGGGTGAGC GGGAACGCGT cCTCCCTCGT GTCACGCGCG GACTCAGGCG CCGGCTGCTC 2460 

CTTCTGTGCA GAAGTCGCTG ACACCGCTTC TGAAACGGCA GAAACATCTA CCCCCTGCTG 2520 

CTGTGCC C AG AGCTCAAAGG ACACATGCAA AAAAACATGC ACCGAAAGGA ACAGCGTcGC 2580 

GCACCGsGGT ACGATCCGAA GGgAGcAATT AAACGTCCGC AATACGTTCT CCCGGCGAGA 2640 

GAATTTCCGT AACACGCACC CCAAAGTTTT CATCAATAAC CACCACCTCT CCTTTTGCGA 2700 

TCAACTTGTG ATTG AC C AAA ATATCAACAG GTTCACCGGC AAGCTTATCC AACTCGATAA 2760 

TGTGGCCTTC CCCCATACCC AGGATATCTT TAATCATCAT GCGTGTACGC CCGAGCTCAA 2820 

CGGTAACTTC CATGAACACG TCCATGATAA GCCCGATATT TCCCTGTTCT GCGCCACCTG 2880 

ctGCATTCTG CAGCGGATGA AACTGGACTG ACTGCACACT CGGACTCGCG GCGCCTATCC 2940 

CCATCTGCAT GTTCACGCCC CCCATTTGAG AATTGCCCAC CTGgCtGCGG GCcTGCATTG 3000 

CCCCCCCCAT CCTCTCGATA ATTCGAACCA TCAGCTGCTC AGACACCAAC TCCCACAGCG 3060 

TATACGAAGT GCCATCTAGC TCCACCGTAT AGGTAAAAAC GCACAGACGC TGCGGGGGAA 312 0 

AGCGAACCAT CGCCTTAGGC ACCTGCACCG ACTCTGCAGG AGCCACACTT ACATTCTGTA 3180 

CGTTCCGCGC CTCAAGCGTA GAAAGCTGTG CGCTGACATA TTGGGTGATC GTTTCACTAA 3240 

CAACCGAAAG TCCCATATCA TCAATTTGAT CGTTGTCCTC ATGACTGACC AAATTGACGA 3300 

GTTTCTGCGC AAACTCAGGA GCCATGAGGA ACAAATGGTC CCCTGnAAAn TCTCCTTCAA 3360 

AATCGATGAC AgTTGCCACT AACATGTCCG GAATGACGCG GGAAAACTnT TCCTTAGAGG 342 0 

AAATTTCCAC ACGCGGCGGG GAAATAGAAA CAnTCTTACC GGTCAAAGAn TCCAAGCTCG 3480 

GGCAAAAGGA ACCCACATTC GCCTGACAGA AAGACTGCAA CAACTCGCTT TGTGCGCTGG 3540 

AGAGCCCCCC ACCGGAGAAA GACGCGCCCG CAGCGGGGGA GTCGCCGGCT CCCATCTCAA 3600 

CACCTGAAAG CAGGGCATCG ATTTCAGCCT GAGAAATAGA GCCGTCACTC ATACAATTCC 3660 

TCCTCGTCCG CGGATAATTC CTCAAAATCC TCTTGGGAGG TACTTTCTAT TCGTTCCAAA 3720 

ATCTGCGCGG CAATCTTTTT TCCCACCACC CCAGGcTGrs AsrrAAACTT CTTGCGGTTC 3780 
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CCAATACTGA GCACAAAAGG ATCGCCCACA TGGGTGTCGT GCAACCGGAT 
ACCCGGAGCC CAAGGATATC GCGCACTGAA AGGCGGAGCG ACCCAACTTC 
TCCATATCCA CCGTGGATAG CTTGTCGCGC AGAACCCCCA TGTAtGCGTG 
TGCGCACCGA AGAAAACCAA AACTGACTCG ACAACTTAGA AATGATAGGT 
TGTACGGAAT GCAAAAGTTC ATCATCCCCT CTTCCTCACC TACCTTTGTC 
CCAACACCAC CATCTCTGAG GGAGGGACGA TCTGCGCGAA TTGCGGGTTC 
GACCCAGGCG CGGACGCAGA TCGATAACcT GCGTCCAGGA TTCACGCACA 
TACGGACGAT GACCCCTTCC ATTACTGAAT TTTCAATATC AGTCAAATCC 
TGGCTGCCTG TCCTGTTCCT CCAAAGAGGC GGTCAATGAT AGAAAAAGTA 
CCACCTCAAG CAtGCGTTCC CTTTGAGCGG ATCCATAGTG ATCACCGCAA 
GGTGGGAATA GAACGGATAA AC TCCTCGTA CGTGAGCTGA TCTACCGACG 
GTGCAC CAT A CTGCGCAGTG cGCCGACAGC GAGGTAGTAG TCAACCGCGC 
TGCATCAACG ACAGTGTACG CATCTGCTCC TTTGAAAACT TATCTGGGCG 
TAGAGCGTAA TCTT a CGGGT GTCGCTGATA GGGCGCGCAT CTTCAATACT 
GAaCTGAtAG CC GTTAgC AG CTGAnT 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



PC1M9&/13041 

GATATCCCCC 3840 

TGCCACCACA 3 900 

GTAGAACTCC 3960 

TCTATGGTGA 4020 

TCGAGCGTCA 4080 

GTTTCAATTT 4140 

TTCGCCAGAA 4200 

CGCTGCACCT 4260 

ATGGAGGGAT 4320 

GCGTAGAAGG 43 80 

CAACGTGCAC 4440 

AAAAGTCTCA 4500 
C CTAAAATC A 4560 
TGmATCCCCA 4620 
4646 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
ATGGAGTAAT GAGCAGTTTA CCCAGTATCT TGAATATCTT TTTCGGGTAC 
TGCGCATACG GTTTCTGCCT ATGCGCGCGA CTTGAATCTT TTTGAACGCT 
CGCGCAGAGA GCGTGCGCGC GCGTAACAGT TTCTGATATG CGTCTGTTTG 
AGGAAGACGG GGACTTTCCG CAGCGAGTAT TAACCGAGTT TTGTCCGTGG 
TTATGTGTTT GCTAAAAAAA AACATTGGTG CGCGGACAAT CCTGCACGCT 
TATAAAAGGT CCTTCAAAGT TGCCTCGTTT TATGTTTCCA CCGCAAGCAA 
CACCTTACCA AGTCGTACGG ATATTTTGTG GCAGGAACGG GATGCGGCAC 
GTTGTATTCA ACAGGATGTC GCGTTTCAGA GATAGCGGCG CTCTCATTGA 



GCAGGCTGTC 
GGTTGCAACA 
TGTGTGAGTT 
TGCGAGGTTT 
TAGTGAGGAA 
AGGCGTTTTA 
TTTTTGCGAT 
AAGATGTGCA 



60 
120 
180 
240 
300 
360 
420 
480 
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TCCGCATCTT AGTTCTGCGA TTGTGCGGGG AAAGGGTGAT CGGGAsCGGA CCGTGTTTAT 
TGCTCCGTTT GCGCAGAATT TTTTGCACGT GTATATGCAG GCGCGTGCGC AcGAnTGTGC 
GCGcTACGCC TCTTGCACAC CCGCGCTGTT TGTGAATCAG CGGGGTCGGT CGCTTTCTGT 
GCGCGGAATA CAGTACCTTG TTAGTCGGTA CGTGCTTTTG GCCCAGGACG TGCACGCGCT 
GTCTCCCCAC GCGTTTCGGC ACAGTTTTGC TTCGACGTTG ATCCGTCGGG GGGCTGATGT 
GCgCGTTGCg CAAGAGTTAT TAGGACATGC GAGTGTGTCT ACCACCCAGC GATATGTGCA 
TGTGACTTCA GAGCAACTGC AGGACTTGTA TCACCGTGCG CATCCGCGTG GATAGGGGGT 
AGGAACGGAG CGTCCAAACG ATGCGGGGAA GCGAGCTGCA GAGAATGTAC ACCAGTGCGA 
AGTGCTTTTT TCTGAGACTT TTTTGAAGAA GACTTTCTTA AGCTCGCTTT TTTTTGGTCG 
ACAATGGGTC GGGGGTAGTC GGATGAATAG TTTTACCAGA ACGGTGGATC TTTTGCATCG 
TGCTTTGGAT GTCAACgcGT TGCGCTATGA AGTGACGGCG AATAATCTTG CGAACGCAGA 
GGTTCCAGGG TTCAAGCGGA CGGACGTAAA CTTTGAAGCA GAGCTCAAGC GTGCTCTGGA 
TTCTCAAAGA AATGAGACAA GTTTTTTCAA GCAGGCAACT GCGGGGACGA ATATGTTGTC 
CAGTGATGTT ATCGACTAcC GcTCGGTGCG TCCGCGCCGC GTGTTAGACT ATTTGACGGA 
TGTGAAGGCG AACGGAAACA ATGTGGATGC TGAGCAAGAA GCCATGCATG TTCTCAAGAT 
TCAGATGCAC TATCAGATGT TGAGTCAGaT GGTAGGGTTC CAGTATCGTC AGGTTGAGTC 
CGTGTTACGT TAAGCGTATG GAGAAGCGTG ATGGGTTTGT TTAGTGGTAT CAATATTGCC 
GCGACGGGTA TGAGCGCGCA nTTTGGCGGG CCGATGTGAT CTCTGACAAC ATTGCTAATG 
CTTCCTCCAC GAGGACTCAA GAAGGTGGAG TGTTTCGGAG GAGCAGGGTA GTTTTGGCGC 
AGAAGAATCC TGGCATTGAC TGGCGTATAC CTTTTGTGCC CGAGCAGTTG GATCGGGGGG 
TAGGCACAGG GGTTCGTGTG GTAAGCATAG AAAAGGACAA CGCTCCTTCT CGTCTTGTGT 
ACGACCCAAC GCACCCTGgA TGCGATTCTA TCAGGGCCGA AGtGGGgTAC GTGGAGTATC 
CtAACGTGGA TATTGTGACA GAGATGGTGG ATCTTATTTC TGCCTCTCGC GCGTATGAGG 
CAAACATATC AGTTATTTCA GGATCAAAAG AAAaTGTTTC AGCGTGCGTT GGAGATTGCG 
CGCTAGGTGT GTTGCGCGTA CAgTCTGTGA AGATGTCTGT GCTGTGTGAG GGGAGGATAC 
AATGACGCCA GTTGGTACCA TTACGAATAG TGCGAATGTA TATAAAGTTC CATCTCTGAG 
GAAGGTGCCT GAAATCGGTC CAGTGTCGGT AGAAAGCGTA AGGcAGCGCA TGCGAGGGAA 
TACTGACGCG GTGGATCAGG CAGTGAACAA AAAGGCGATG AGTTTTGAGC AAACGTTGCT 
GCGCGCTTTT GATCAGGTAA ATCAAAAGCA GCAGAAGACT GCTGAGTTGA CCGAGCAAAT 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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GATAGTAGAT CCTGAGTCTG TTGACGTGCA TGATGTAACA GTGGCGATGG CGGAGGCTAG 22 80 

TATGTCCTTG AAAATCGCGC AGACTGTCAT TGATAAAGTC CTTAAGAGCT GGAACGATGT 2340 

CACCACTGCT CGGTAAGGTT TACAAGGCCG GGCTGTTCTG CAAAAAGAGT ACCGACGGTA 2400 

TATCAGgTGA AAAGAGGGTG GGACGCGCTT AGTGCGCATT GGCTCGTTCT ATAGTGAGGG 2460 

GAGGGGACAC GCGTGGGCGA ATGGTTGGGG CAGCTCGGAG TCAAACTCAA AACACAGTGG 2520 

AAGAAGTGGA CGCTCGTGCA GAAGTCTGTG CTTGCCGGCG CGGCGCTCGT GTCTGTCATG 2580 

GGGGTTGTTG TCTTGCTCAC GTgGtCGcGA AGCCGACkcT CGTGCCACTT ATCGACACTC 2 640 

CTATCACTGA TGAGACGGTG CGGGAAAAGA TTATCCTGCG CCTTAACGAA GAGAATGTGC 2700 

GTGCAACCGT CTCAAGCGTT GGGTTGATTT CTGTCTCGGA TGAGAAGACA GCGCGTCGTA 27 60 

TGCGCAGCAT CTTAATTCGC GAAGATTTGA TCCCAAAAAA TGTGGACCCA TGGGCCATAT 2820 

TCGACGTCGA GCGATGGACG CGTACTGACT TTGAGCGCAG GGTGGACGTG CGGCGTGCAA 2880 

TTAATAATAC CGTTACCAAT CATATCAAAG CGCTCGACGA CATCGATGAT GCCCATGTAG 2940 

TAATAAACGT GCCTGAGGAT GCGCTTTTTC AGGCAGACCA GAAAC C T ATT ACTGCGAGCG 3000 

TTGTCATTTT CCCTAAACCG TCGAGCACGA TCGCCTCAGA AAGAAAAAAA ATAGAAGGCA 3060 

TTCAGAAACT ATTAAAGCTT GCAGTTCCTG GACTGAAGGA TGAAAACATC ACGATTGTAG 3120 

ATAGTGATGC TACCGTCCTA AATGATTTTG AAGGGTTCAA GGACGCTGAT CGGCTGAGTC 3180 

TCATTGAAAA GC AAC AG AAA ATGATTGCGA arCTGgAATC CCAGTATGAG GCAAAAGTGC 3240 

TGGCTCTCTT GCAAAAGACG TACGGTAAAG ACCGGGTGCG CGACTTAAAT ATCAAAATTG 3300 

AAATGGATCT TTCTGAAAAG ACGTCGCAGA tACCAAGTAT CTGCCTATAG AAATCCGTCA 33 60 

GGACAATCCG GATACCCCGT GGGATGATTC TCAGGTTGTG CCCTCTGTCA CTTCGATATC 3420 

TGAAACGGCm ACcAC tACGT GGnCAGGGTA CGGGGCTTAA CCCTGAAGGA CCGCCGGGAG 3480 

TTGAGGGTCA AACACCTCCT GCATACAAAG ACATGAGCAA CCAGGTGGGA CTTTCTAACC 3540 

AGTCGGTCGT TAAGAAGCAA GAGGCGATTA GCAAGAGTGA GATCAACGAA GTAGTGAGCC 3600 

CGGTGCTCGG CCGCAGGACG GTGTCGGTCA ATATCGATGG AGAATGGCGC AAAAAGAGAG 3660 

ACGAGCACGG AAGATTCATT GTGAAGGAAG GACACATTGA ACGTGAGTAT ATCCCCATCT 3720 

CTGnTGAGGA GCTGCGGGAG GCAACGAAGG CAGTGCAGGA TGCAATCGGC TTTGATGCGG 3780 

GGCGTAAGGA TTCGGTAAGT GTTTTAAATA TCAAATTTGA CCGGACGTCA GAATTTGATA 3840 

GAGAAGATGA GCATTACCTG CGCGTCCAGC AGAGGAACAT GATCaTCtTa TACTCCCtTg 3900 

CcAGtgtgGC AATCGTTTTA TTTATCTTCA TGGTATACAA GGTTATCAGC AAAGAGGTGG 3960 
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AGCGTCGCCG TCGTCTGCGG GaAaGGAGCT TTTAAGGCAG CAGCAACTGA TGAGGGAGCG 
TGCCCTGTGG GAGGCTGAAC AGGCGGGGAT GAATGTTTCC ATGTCGGTGG AAGAGCGTAA 
GGnCTTGAAT TGCAAGAGAA TGTGTTGAAT ATGGCGCGGG AGCATCCGGA AG AG TTGCGT 
TGCTTGTGAG AACGTGGTTG ATGGAGGAGT AGTACTATGG CCGTTACATC CGTGAAGGAT 
AAGCTCGCCA CGGGAGAAAA AAAGCAACGG GATATCAAGT CTCTCAATGG TCGGCAAAAG 
GCAGCGATAT TTCTAGTTTC TATTGGGGAG GAAATATCCG CTAAGGTCAT GGGAGAACTT 
AAGGAAGACG AGATTGAAAA GTTGGTGTTT GAAATAGCGC GTACAGAGTC aGTTGATGCA 
GAACTCAAGG ATGCagTTTT AGAAGAaTTC CAGGAACtGA TGACCGCACA AAACTTTATC 
ACCTCAGGAG GTATCGATTA CGCGCGGGGA TtGTTGGAGA AGTCGTTGGG AAGTCAAAAA 
GCAATCGAGA TCATAAATCG GCTGACAAgC TCCTTGCAGG TGCGTCCCTT TGACTTTATT 
CGCAGAACTG ATCCCACACA CCTGTTAAAT TTTATTCAGC AAGAGCATCC GCAGACAATT 
GCGCTTATTT TGGCGTACCT TGAGCCGAAT AAAGCTTCTG TTATTTTGCA GAACCTCCCT 
GATGAGATTC AGAGTGATGT GGCTCGGCGC ATAGCCACGA TGGATCGGAC GTCCCCTGAT 
GTGTTGCGCG AGGTTGAACG AGTACTTGAG AAAAAATTGT CAACGCTTTC TAGCGAGGAT 
TATACGGCCG CAGGAGGTGT CCAGAACATC GTGGaCATCT TGAATTTGGT CGATCGTTCT 
TCTGAAAAAT CTATTGTTGA AGCATTGGAA GATGAAGATC CAGATCTTGC AGAGGAAATT 
AAAAAACGTA TGTTCGTGTT TGAGGATATT GTAATGCTCG ACGATCGGGC CATTCAAAAG 
GTGCTGCGGG AGGTGAATAT GGAAGAACTC GCAAAGGCAC TCAAGGTTGT CGACACTGAA 
GTACAAGATA AAATTTTTAG GAATATGTCG AAGCGGGCAG GGAGTATGCT GAAGGAAGAA 
ATGGAATACA TGGGGCCGAC CCGCTTGAAA GATGTGGAGG AAGCCCAGCA GAAGGTTGTT 
TCTATCATCA GACACCTTGA AGATAGTGGT GACATTGTCA TCGCGCGTTC AGAAGAAGAC 
GAG ATGa TTG TGTAAATGTT GTTCCTGATA AGCGATATGG GGTTCGAAAG GAAGCAGACA 
GTATGCCAAA GmTsATATTT CGGAACCATG AAGTGAAGAA TCTTGATCAG TTCTTGCTGC 
TTGATCTGAG CAGGTCTTTT GGTGTCGAGC CTCAGATTGA GGAGGTGCAA AGCGAACCTG 
TGTGTCCAGT TCCTGATATG CGTGAAGTGC AAGAGGAAGT TGAGCTGTTT CGAAAAAGTT 
GGGAAGAAGA GCAGGTGCAG CTGCGCGCGC GTGCAGAGCG TGAGGCACAA GATCTAAAGG 
AGCGTGTAGA GGAGGAAATC ACAGCATATC GCGAACAGTG TACGCAGGAG GCGGATCGTA 
TCCTTGCTCA GGCAAAGGAA CAGTCTGAGC TACAAATTAG CGAGGCGCAA CAGCAAGCTG 
AACGCATGAT TGCTGAGGCA GAGACGTCTC GTCAGAAAAT ATGTGATCAC AGTAAGGCAG 
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AAGGTATTCG TCTTGGCAAG GAAGAAGGGT TTCGTGCGGG ACAGGAAGAG GTGCGGTATT 57 60 

TAACTGAGCG TTTGCATAAG ATGATCGAAG AAGTGATGGG GCGGCGTCAG GGTATTTTGC 5820 

GGGAAACCGA AAGACAGATT GTTGATCTGG TGTTGTTGAT GACAAGGAAG GTGGTCAAGG 5880 

TCATTTCTGA AAACCAACGC GCTGTTATCA GCGCAAATGT GGTGCATGCG TTGCGTAAGG 5940 

TGCGAACGCG CGGAGCGgTG ACGCTGcGGG TAAACCTTGC GGATGTGGAG CTTGTTACCC 6000 

AGCACAAGCA GGAGTTTATC GCTGCAGTGG AGCGTGTGGA TGATCTAACG GTAGTGGAGG 6060 

ACACGTCAGT GGGTAGGGGC GGTTGCgTGG TGGAAACGGA TTTTGGAGAG ATTGACGCGC 6120 

GGGTTGCAAG TCAGCTCCAT GaGCTTGAGC AGCGTGTTTT GGAAGTTGCC CCCATTGTAG 6180 

TGTCATCAAT GTCAGCATCT AAGGGTTCTT GATAGAGAAA GAGGCGTGGG TGTGCGTGTA 6240 

TGGAAGCAGA CCTGTTGTGC AAGTATGAGG TGGCgCTCCG CGAGAGTGAG CCGGTAAAGT 6300 

ACGTTGGGCA TGTGACAGCA GTGAGGGGTT TATTGATTGA AAGTCGTGGC CCTCACGCGG 63 60 

TAGTTGGTGA ATTGTGTCGG ATTGTGTTGC GCCGCCAGGG GCGACCGTTG ATAGCAGAGG 6420 

TAGTAGGACT TGCaGGATCG ACGGTAAAAC TGATGAGCTA CACCGATACG CACGGGGTTG 6480 

AAGTTGGCTG TGCGGTGGTA GCAGAAGGGG CGGCAtTTCA GTCCCCGTAG GAGATGCTTT 6540 

ACTCGGAcGC GTTTTGAACG CGTTTGGGAA GGCAATTGAC GGGAAGGGGG AGATATATGC 6600 

cgTCCTCCGC TCCGAGGTGT TGCGCGCGTC TTCTAATCCT ATGGAGCGTC TTC CGATT AC 6660 

GCGTCAAATG GTAACAGGAG TGCGGGTGCT TGATTCkTtG CTGGCAGTTG GTTGCGGACA 6720 

ACGTCTGGGT ATTTTTTCCG GTTCGGGGGT TGGGAAGTCG ACGCTGATGG GGATGATCGC 6780 

GCGCAATACA GACGcAGATG TGTCGGTCAT TGCCCTTATC GGGGAGCGTG GCCGTGAAGT 6840 

GATGGATTTT GTTGCGCATG ATTTGGGTCC TGAGGGTTTG AAGCGCTCGG TAATAGTTAG 6900 

TGCGACGTCT GATGAAAnGT CCTTGCGCGG GTACGAGGTG CGTACACGGC GACAGCGATT 6960 

GCAGAGTACT TTCGGGATCA AGGCAAACAG GTGCTGCTGC TGTTTGATTC TCTGACGCGC 7020 

TTTGCAAAAG CTCAGCGTGA GATTGGGTTA GCGTCGGGGG AGCTCCCTGC AACGCGTGGA 7080 

TATACCCCGG GGGTATTCGA AACGTTACCG AAACTGCTTG AGCGTGCAGG TTCTTTTTCC 7140 

ATGGGGAGCG TCACCGCTTT TT AT AC TG TT TTGGTAGATG GGGACGATCT CGATGAGCCG 7200 

ATATCAGACG CCGTGCGTGG AATTGTAGAC GGGCACATTG TACTCAGTCG CGCGCTTGCg 7260 

cAGcgCAATC ACTATCCTGC AATAGACGTG TTGCAAAGCG TTTCTCGCTT GGCGCACCGC 7320 

GTGCTGGGTG CAGACATGAA AGAGGCAGTG CGCATAGTGC GTCGTGCGCT TGCAGTGTAC 7380 

GCAGAAGTAG AGGATTTGGT ACGAGTTGGT GCGTACCAGC AGGGGAGTGA TGCAGAACTT 7440 
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GATCGAGCTA TTGCGATGCG CGCAGAGCTT GAACGGTTCC TAACGCAAGG AGCCCAGGAG 
CGCGTGCGTT TTCAGGATAC TGTAACGTCG CTGTCCATGC TGACAGGGCT CAGTATAGCA 
CAGCCGCCTT CGGGTGTGTG AATCTGCAAG AGCAGAGGAG ATAGCGCGTG TGAAAAGGTT 
TTGTTTTTCT CTTGAGCGTG TGCGACGCTT GAGAGCGTTT CGTGTACGCG AGCTGGAAGT 
TGAGTTAAGC AAAGTTCTTG CAGAATACGG AAGCATAGAT ACACAGATTC GATCGATTGC 
TGGCGAGTAT CGTGCGCGGA TGCAGGACGT AGCGCCAAAG CGTGGAGCAG TTTTTTCTGC 
TGCGTCGGTG AGCGCTGTGC AGGATCAAAT TGACGTGTTG CAATTACGCC GAGAACAGCT 
GCTCCATAAG CAGGCGCACC TTTCTTTTAC TCTTGAGCAA TTGCGAGAAC GATACGCGCA 
CGnGCGCCGT GCACACGAGG CTTTGCTCAT GCTTGAAGAA AAGGAGAAAA CACGCTGGCG 
AGAGCAGCGA CTGCGCGCTG AGGACCGAGC GTGTGACGAC CTGGTCAGCG CACGCGTaCC 
TGGTGCACCC AGCAAGCATT AATGGCTGGC GCGCTGCGTG CGCGCTcGGG TGTATGAGGA 
AGGCGGTCCA TGTCCGTGGA AGAGTATGAG CGTTTCGTGT GCCGTGCACG CTCGTTCCAA 
GATGGTGTCT GCCTCATTTC CCGCTTCTTC GTACCCTGCA GAACACAGAT CCCCCGTGAA 
CGCAAGGTGT GCAATACGGT ATAGGTAACA GCCATACGCA GGGGATGCAA AACAGGTAAC 
GGTAAAACCT GCGCAATTGA GGGAACAGTC TCCCTGTAAC AGGGTGCTCT CGTGGGCATT 
CCAGCACTTT TCCCCGCAAA AGATGTGCGG GGAGTATACG CGCAAAAGCG TGCGCACGCC 
CGCTTGGGTT GCGTCATCTG TGCGGGTAAG AAAAACAGCC GTCAGGGTGC AGGAATTCTT 
TTCAATGGTG TGGATCAAGT GAGTGCTCAC GTGCCCGGCA TCGATTAGGA GCGCTTCGTT 
TAAGGACAGG TTGCACAGCA GATAACTCGT GCGCTGGGTA TGCTGCTCTG CAGGGTGTAT 
GTAAACTTTC ATGGCAGGTC GCTCAGGTAT ATGCGGGCGT GTTCGGTACT AGATCCTTTG 
AATGTGCaGT GCGTAGTGTG CGAGGGCATA AAGTGCGTGT GTTGCACGGT GCAATTTTCA 
AAACGCACTG TTGCAAGCGT AGCGTGTAAG AAACGGGTGT GAGAAAGATC GCTCTCCTGA 
AAAGAAACAT CCTGCGCGTG GATACGGTTG AAGTTGTCAG ATAAGAGTAC TGCATGTGCG 
AAGCTCACCG TGTGCAGGAT GCTCCCCTCA AAAGAGGAGA ACTGAATGTC AGCTTCAGTG 
AAAGAGCTAT TGAC CAGGTA GCAACGCTCA AAAAAACACA TACGCATCAC TACATTCGCG 
AACACACATG CATTAAATAC GGTATTAAAA AAATTGCAGC CGACGAAATG AAAACCCGAA 
AAGTCTACGC GGTTCAGGCG CATGCGTGGA GCGTACACGC CGTGTATGTG TGTTGACGTA 
TGCATGAGCT TAGCAAAGTC CGACGTAGAG CAATACGGTG TGGGTTCGAA CATGCGCGCA 
AAGCTTATAC GGGCTGGACG GGTGTGGTGT CAATGTATGC GGTGTGCAGG ACCTGGGGAT 
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